NUCLEOTIDE SEQUENCES FOR TRANSCRIPTIONAL REGULATION 
IN COR YNEBA CTERIUM GLUTAMICUM 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority to U.S. Provisional Application 
60/248,219, filed November 15, 2000, which is incorporated herein by reference 
in its entirety. 

STATEMENT REGARDING FEDERALLY-SPONSORED 
RESEARCH AND DEVELOPMENT 
Not applicable. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] The invention relates to the areas of microbial genetics and recombinant 

DNA technology. The invention provides DNA sequences, vectors, 
microorganisms, and methods useful for inducing and regulating the expression 
of genes, including those that are involved in amino acid biosynthesis, in bacterial 
cells. 

Background Art 

[0002] Coryneform bacteria are Gram-positive bacteria frequently used for 

industrial-scale production of amino acids, purines, and proteins. Although 
coryneform bacteria, particular Corynebacterium species, have been widely used 
for industrial purposes for many years, the techniques of molecular biology have 
only recently been employed to augment the usefulness of these organisms in the 
production of amino acids and other products. 

[0003] One way to improve the productivity of a microbial strain is to increase 

the expression of genes that control the production of a metabolite. Increasing 



expression of a gene can increase the activity of an enzyme that is encoded by that 
gene. Increasing enzyme activity can increase the rate of synthesis of the 
metabolic products made by the pathway to which that enzyme belongs. In some 
instances, increasing the rate of production of a metabolite can unbalance other 
cellular processes and inhibit growth of a microbial culture. The modified culture 
will make more product per cell, but will not be able to generate enough cells per 
volume to show an improvement over the parent strain in a fermentor. 

Transcription is the process by which an RNA molecule is synthesized 
from a DNA template and occurs by the interaction of a multisubunit enzyme 
complex, known as RNA polymerase, with a DNA molecule. The RNA that is 
synthesized by this process ultimately directs the production of protein products 
within the cell. In general, the rate at which RNA is synthesized from DNA, i.e., 
the transcription rate, directly influences the level of synthesis of the 
corresponding protein product. 

Promoters are DNA sequence elements that regulate the rate at which 
genes are transcribed. Promoters can influence transcription in a variety of ways. 
For example, some promoters direct the transcription of their associated genes at 
a constant rate regardless of the internal external cellular conditions. Such 
promoters are known as constitutive promoters. In many cases, however, a 
promoter will direct transcription of its associated gene only under very specific 
cellular conditions. For example, promoters that turn off gene expression during 
the growth phase of a microbial culture, but turn on gene expression after optimal 
growth has been achieved can be used to regulate genes that control production 
of a metabolite. The new strain will have the same growth pattern as the parent 
but produce more product per cell. This kind of modification can also improve 
titer (g product/liter) and yield (g product/g glucose). Nucleotide sequences have 
been identified that can be used to increase or decrease gene expression in 
Corynebacterium species. These regulatable promoters can increase or decrease 
the rate at which a gene is transcribed depending on the internal and/or the 
external cellular conditions. Frequently, the presence of a factor, known as an 



inducer, can stimulate the rate of transcription from a promoter. Inducers can 
interact directly with molecules that, themselves, physically interact with the 
promoter or with DNA sequences in the vicinity of the promoter. Alternatively, 
the action of an inducer in stimulating transcription from a promoter may be 
indirect. Whereas inducers function to amplify the level of transcription from a 
promoter, there is a class of factors, known as suppressors, that reduce or inhibit 
transcription from a promoter. Like inducers, suppressors can exert their effects 
either directly or indirectly. 

[0006] Besides regulation through inducers and suppressors, certain promoters 

are regulated by temperature. For instance, a the level of transcription from a 
promoter may be increased when cells harboring that promoter are grown at a 
temperature that is greater than the optimum or normal growth temperature for 
that cell type. Similarly, there are promoters that will enhance gene expression 
in cells grown at temperatures below the normal growth temperature. 

[0007] Promoters are found naturally in wild-type cells where they regulate the 

expression of specific genes. Promoters, however, are also useful as tools of 
molecular biology in that they can be isolated from their normal cellular contexts 
and engineered to regulate the expression of virtually any gene. 

[0008] The use of regulatory sequences from Escherichia coli to control the 

expression of reporter genes in Corynebacterium have been documented. The 
lacl q repressor and the tac promoter/reporter genes from E. coli were on plasmids 
that replicate in Corynebacterium. See, e.g., Morinaga, Y. et aL, J. Biotechnol. 
5:305-312 (1987). In addition, Ben-Samoun et aL, FEMS Microbiology Letters 
1 74 : 1 25 ( 1 999), which is incorporated herein by reference in its entirety, disclose 
the use of the E. coli araBAD promoter and the araC activator on a plasmid 
which replicates in Corynebacterium glutamicum cells to stimulate the expression 
of the GFPuv reporter gene only when L-arabinose is present in the growth 



medium. The authors acknowledge, however, that the level of expression from 
the araBAD promoter in C. glutamicum is 6.5 fold lower than that which was 
observed in E. coli, the native species for the promoter. 

[0009] U.S. PatentNos. 5,693,781 and 5,762,299, each of which are incorporated 

herein by reference in their entireties, disclose the isolation of promoter 
sequences from the coryneform bacteria, Brevibacterium flavum. Sequences 
described in these patents were isolated on the basis of their ability to direct 
expression of a reporter gene in B. flavum at a level that was greater than the 
expression level observed in B. flavum with the synthetic tac promoter. Also 
disclosed in U.S. Patent Nos. 5,693,781 and 5,762,299, each of which are 
incorporated herein by reference in their entireties, are B. flavum promoters 
capable of expressing a reporter gene in B. flavum cells when grown in medium 
containing: (a) ethanol but not glucose, and vice versa; (b) glucose but not 
fructose; and (c) glucose but not casein hydrolysates/yeast extract/glucose, and 
vice versa. The novel promoter sequences disclosed in the present application are 
different from those described in U.S. Patent Nos. 5,693,781 and 5,762,299. 

[0010] A limited number of C. glutamicum promoters have been described to 

date. For example, the C. glutamicum aceA promoter is disclosed in Wendisch 
et ah, Arch Microbiol. 168:262 (1997) and in U.S. Patent No. 5,700,661 (where 
it is termed the isocitrate lyase promoter), each of which are incorporated herein 
by reference in their entireties. In both of these references, the aceA promoter 
was linked to a reporter gene in transformed C. glutamicum cells, and produced 
an extracellular protein, not a product of metabolic engineering. Expression of 
the reporter gene was found to be greater in C. glutamicum transformants that 
were grown in the presence of acetate than it was for transformants grown in the 
presence of glucose (Wendisch etal.,Arch Microbiol. 1 68:262 (1 997)) or sucrose 
(U.S. Patent No. 5,700,661). 

[0011] Similarly, the aceB promoter from C. glutamicum is disclosed in 

Wendisch et al., Arch Microbiol 168:262 (1997) and in U.S. Patent No. 
5,965,39 1 , which is incorporated herein by reference in its entirety. Both of these 



references describe transcriptional fusions consisting of the aceB promoter region 
linked to a reporter gene in C. glutamicum transformed cells. Expression of the 
reporter gene was found to be greater in C. glutamicum transformants grown in 
acetate-containing medium than it was for transformants grown in glucose- 
containing medium (Wendisch et ah, Arch Microbiol. 168:262 (1997)) or other 
carbon sources (U.S. Patent No. 5,965,391). 

Reinscheid et al, Microbiology 145:503 (1999), which is incorporated 
herein by reference in its entirey, discloses a transcriptional fusion between the 
C. glutamicum pta-ack promoter and a reporter gene (chloramphenicol 
aceryltransferase). C. glutamicum cells harboring the transcriptional fusion 
demonstrated enhanced reporter gene expression when grown in acetate- 
containing medium as compared to transformed cells that were grown in glucose- 
containing medium. 

In Patek et al, Microbiology 142:1297 (1996), which is incorporated 
herein by reference in its entirety, several DNA sequences from C. glutamicum, 
identified on the basis of their ability to promote the expression of a 
chloramphenicol resistance reporter gene in C. glutamicum cells, are disclosed 
and compared to one another in an attempt to define a consensus sequence for C. 
glutamicum promoters. 
| There is clearly a need for a broader assortment of well-defined 

Corynebacterium species promoters than has been heretofore described. Such 
promoters would be useful in the constitutive and/or regulated expression of 
genes in coryneform cells. For example, a collection of C. glutamicum 
promoters, regulated by inexpensive carbon sources, would facilitate the 
industrial-scale production of amino acids and purines in C. glutamicum cells by 
enhancing the expression of genes that encode components of the biosynthetic 
pathways for the desired amino acids or purines. Likewise, a versatile array of 
coryneform promoters would be useful for the industrial scale production of 
heterologous polypeptides in C. glutamicum cells by stimulating the enhanced 
expression of genes encoding such heterologous polypeptides. 



BRIEF SUMMARY OF THE INVENTION 



[0015] The present invention relates to isolated polynucleotides that function as 

transcriptional regulators, in particular, promoters, in Corynebacterium species 
host cells, preferably Corynebacterium glutamicum, Brevibacterium flavum, or 
Brevibacterium lactofermentum host cells, even more preferably C glutamicum. 
host cells. These promoters are useful for regulating and enhancing the 
production of a variety of products in such host cells. Examples of products, the 
production of which may be enhanced in Corynebacterium species host cells as 
a result of the present invention, are amino acids, such as lysine; purine 
nucleotides, such as inosinic acid; and heterologous polypeptides. Since 
Corynebacterium species are especially useful for the industrial-scale production 
of amino acids, purines, and polypeptides, use of the promoters of the present 
invention may greatly improve the yields of these products from 
Corynebacterium species host cells. 

[0016] In one embodiment of the present invention, an isolated polynucleotide 

is provided, comprising a first nucleic acid, the sequence of which is selected 
from the group consisting of SEQ ID NO:4 through 22. 

[0017] Further embodiments of the invention include and isolated polynucleotide 

comprising a first nucleic acid, the sequence of which is at least 90% identical, 
and more preferably at least 9 1 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
identical, to a sequence selected from the group consisting of SEQ ID NOs: 4 
through 22. 

[0018] Additional embodiments of the invention include an isolated 

polynucleotide comprising a first nucleic acid, the sequence of which comprises 
about 6 contiguous nucleotides, preferably about 1 0 contiguous nucleotides, even 
more preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or 500 contiguous 
nucleotides, of a sequence selected from the group consisting of SEQ ID NOs: 4 



through 22, wherein the first nucleic acid is capable of regulating transcription of 
a second nucleic acid, preferably as part of a promoter. 

[0019] In one aspect of this embodiment, the promoter comprises any one or 

more of the following genetic elements: a minus 1 0 ("-10") sequence; a minus 35 
("-35") sequence; a transcription initiation site; an enhancer region; and an 
operator region. Preferably the genetic elements are specific for Corymb acterium 
species, more preferably are specific for Corynebacterium glutamicum, 
Brevibacterium flavum, and Brevibacterium lactofermentum, and even more 
preferably are specific for Corynebacterium glutamicum. 

[0020] Additional embodiments of the invention include an isolated 

polynucleotide comprising a first nucleic acid at least 1 0 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is selected from the group 
consisting of SEQ ID NOs: 4 through 22. The meaning of the phrase "stringent 
conditions" as used herein is described infra. 

[0021] In further embodiments of the present invention, an isolated 

polynucleotide is provided, comprising a first nucleic acid as described above; 
and a second nucleic acid. According to this embodiment, the first nucleic acid 
comprises a transcriptional regulatory region, preferably, a promoter. 

[0022] In one aspect of this embodiment, said second nucleic acid encodes one 

or more polypeptides. Preferably, the physical location of the first nucleic acid 
relative to the second nucleic acid is such that, under the appropriate conditions, 
the first nucleic acid regulates transcription of the second nucleic acid, thereby 
facilitating production of the polypeptide. 

[0023] In another aspect of this embodiment, the second nucleic acid encodes one 

or more components of a purine biosynthesis pathway. One example of a purine 
biosynthesis pathway included in this aspect of the invention is the enzymatic 
pathway that results in the synthesis of inosinic acid. 



[0024] In another aspect of this embodiment, the second nucleic acid encodes one 

or more heterologous polypeptides. The heterologous polypeptide may be one 
that is from a Corynebacterium species or one that is from a non- 
Corynebacterium species. 

[0025] In one particularly preferred aspect of this embodiment, an isolated 

polynucleotide is provided, comprising a first nucleic acid as described above; 
and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. Examples of amino acid biosynthesis pathways included 
in this aspect of the invention are the enzymatic pathways that result in the 
synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L-phenylalanine, L- 
tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L-proline, L- valine, 
L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L-asparagine, L- 
aspartic acid, and L-threonine. 

[0026] In a preferred aspect of this embodiment, the second nucleic acid encodes 

one of the following components of the L-lysine biosynthesis pathway: 
aspartokinase, aspartate beta-semialdehyde dehydrogenase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, dihydrodipicolinate synthetase, 
dihydrodipicolinate reductase, or pyruvate carboxylate. 

[0027] Another embodiment of the invention provides an isolated polynucleotide 

comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2, and a 
second nucleic acid which encodes a polypeptide which functions as a component 
of the lysine biosynthesis pathway. In this embodiment, the first nucleic acid 
regulates transcription of the second nucleic acid. 

[0028] Yet another embodiment of the invention provides an isolated 

polynucleotide comprising a first nucleic acid, the sequence of which is at least 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO: 3; and a second nucleic acid which encodes polypeptide which functions as 
a component of an amino acid biosynthesis pathway, preferably a lysine 



biosynthesis pathway. In this embodiment, the first nucleic acid regulates 
transcription of the second nucleic acid. 

Further embodiments of the invention provide isolated Corynebacterium 
chromosomes with a first nucleic acid integrated into the chromosome, the 
sequence of which is one of the following: a sequence which is at least 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:l ; 
a sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, or 99% identical to SEQ ID NO:23, a sequence of which is at least 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO: 30, or a sequence which is identical to either SEQ ID NO:26 or SEQ ID 
NO:27. In the embodiments where the integrated first nucleic acid has a 
sequence identical to either SEQ ID NO:26 or SEQ ID NO:27, the isolated 
chromosome also has a third nucleic acid integrated therein, the sequence of 
which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% 
identical to SEQ ID NO:28, operably linked to a transcription control region, 
wherein the nucleic acid encodes a polypeptide at least 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:29. In these 
embodiments, the isolated Corynebacterium species chromosome also has a 
second nucleic acid integrated therein, wherein said second nucleic acid encodes 
polypeptide which functions as a component of an amino acid biosynthesis 
pathway, preferably, a lysine biosynthesis pathway. Additionally in these 
embodiments, the first nucleic acid regulates transcription of the second nucleic 
acid. 

)] In another aspect of the above embodiments, a Corynebacterium species 

host cell comprising a Corynebacterium species chromosome as described above 
is provided, preferably a host cell derived from Corynebacterium glutamicum, 
Brevibacteriumflavum, and Brevibacterium lactofermentum. In a related aspect, 
a method of producing such a Corynebacterium species host cell is provided, 
which comprises transforming Corynebacterium species cells with a vector 
comprising a first nucleic acid as described above, wherein said vector facilitates 
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integration of the first nucleic acid into the chromosome of said Corynebacterium 
species cells, and selecting the host cell in certain preferred embodiments, the 
vector also comprises the second nucleic acid described above, and/or the third 
nucleic acid described above, with the first nucleic acid being physically situated 
to regulate transcription of the second nucleic acid. These vectors are also 
provided. 

[0031] Examples of amino acid biosynthesis pathways which may be included 

in the above embodiments of the invention are the enzymatic pathways that result 
in the synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L- 
phenylalanine, L-tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L- 
proline, L- valine, L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L- 
asparagine, L-aspartic acid, and L-threonine. 

[0032] In those embodiments where the second nucleic acid encodes a 

component of the L-lysine biosynthesis pathway, the second nucleic acid may 
encode one or more of the following enzymes: aspartokinase, aspartate beta- 
semialdehyde dehydrogenase, diaminopimelate dehydrogenase, diaminopimelate 
decarboxylase, dihydrodipicolinate synthetase, dihydrodipicolinate reductase, or 
pyruvate carboxylate. 

[0033] In another embodiment of the present invention, a method is provided for 

producing a vector. More specifically, the method comprises inserting into a 
vector any of the isolated polynucleotides described herein. 

[0034] In another embodiment of the present invention, a vector is provided 

comprising any of the isolated polynucleotides described herein. 

[0035] In one aspect of this embodiment, the vector further comprises a multiple 

cloning region into which a heterologous second nucleic acid can be inserted, 
thereby allowing for the regulated and/or elevated expression of a variety of 
second nucleic acids therein. 

[0036] In another aspect of this embodiment, said vector is a shuttle vector. As 

used herein, the term "shuttle vector" refers to a vector that can replicate and be 
maintained in more than one host cell species. In a preferred aspect of this 



embodiment, said shuttle vector can replicate and be maintained in a 
Corynebacterium species host cell, preferably a host cell derived from 
Corynebacterium glutamicwn, Brevibacterium flavum, and Brevibacterium 
lactofermentum, and in an E. coli host cell. 

[0037] In another embodiment of the present invention, a method of producing 

a transformed Corynebacterium species host cell is provided. The method of this 
embodiment comprises introducing into Corynebacterium species cells, 
preferably Corynebacterium glutamicum, Brevibacterium flavum, or 
Brevibacterium lactofermentum cells, a vector comprising any of the isolated 
polynucleotides described herein. 

[0038] In one aspect of this embodiment, the polynucleotide, after being 

introduced into the host cell, is integrated into the chromosome of the host cell. 
In another aspect of this embodiment, the polynucleotide, after being introduced 
into the host cell, is maintained as an extrachromosomal element; i.e., it does not 
integrate into the chromosome of the host cell. 

[0039] In another embodiment of the invention, a method is provided for the 

production of a biosynthetic product such as an amino acid, a purine nucleotide, 
or a heterologous polypeptide. According to this embodiment, a transformed 
Corynebacterium species host cell harboring a vector of the invention is used. 
More specifically, the vector comprises any isolated polynucleotide described 
herein. 

[0040] In a preferred aspect of this embodiment, the polynucleotide comprises a 

first nucleic acid which regulates transcription of a second nucleic acid, where the 
second nucleic acid encodes a polypeptide which functions in an amino acid 
biosynthesis pathway, as described above. In this aspect, the first nucleic acid 
regulates transcription of the second nucleic acid, thereby resulting in elevated 
production of the amino acid by the transformed host cell. 

[0041] In another aspect of this embodiment, the polynucleotide comprises a first 

nucleic acid which regulates transcription of a second nucleic acid, where the 
second nucleic acid encodes a component of a purine biosynthesis pathway. In 
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this aspect, the first nucleic acid regulates transcription of the second nucleic acid, 
thereby resulting in elevated production of the purine by said transformed host 
cell. 

[0042] In yet another aspect of this embodiment, the polynucleotide comprises 

a first nucleic acid which regulates transcription of a second nucleic acid, where 
the second nucleic acid encodes a heterologous polypeptide. In this aspect, the 
first nucleic acid regulates transcription of the second nucleic acid, thereby 
resulting in elevated production of the heterologous polypeptide by said 
transformed host cell. 

[0043] In certain preferred aspects of this embodiment, the first nucleic acid 

comprises a promoter. Preferred promoters include, but are not limited to the 
following types of promoters: a constitutive promoter, an inducible promoter, a 
derepressable promoter, a heat sensitive promoter, and a cold sensitive promoter. 

[0044] Where the promoter is an inducible promoter, the method for the 

production of a biosynthetic product as above may further comprise adding an 
inducer to the culture medium. The inducer may be present throughout the 
growth of the host cell, or alternatively, may be added to the culture medium after 
the host cell has grown to an optimal density. In a preferred aspect, the host cell 
is self-induced, i.e. , the inducer is produced by said host cell at some point during 
its growth cycle. When the inducer is present in the culture medium, the first 
polynucleotide facilitates enhanced production of the biosynthetic product. 
Inducers which may be used in the invention include, but are not limited to acetic 
acid, pyruvate, ethanol, starch subunits, sugars, cellulose subunits, fatty acids, and 
triglycerides. Sugars that may be used as inducers in the present invention 
include, but are not limited to fructose, maltose, lactose, and arabinose. 

[0045] Where the promoter is a derepressable promoter, the method for the 

production of a biosynthetic product as above may further comprise use of a 
culture medium which lacks a suppressor. In this aspect, the suppressor may be 
absent from the culture medium throughout the growth of said host cell, or the 
host cell may be initially cultured in a culture medium containing the suppressor, 
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and be transferred to a culture medium lacking said suppressor when the host cell 
culture has reached an optimal density. In certain preferred aspects, the host cell 
may be self-regulating, i.e., the suppressor may be removed from said culture 
medium through metabolic depletion by the host cell. When the suppressor is 
removed or depleted from the culture medium, the first polynucleotide facilitates 
enhanced production of the biosynthetic product. Suppressors which may be 
used in the invention include, but are not limited to purines, pyrimidines, amino 
acids such as histidine, and oxygen. 
[0046] Where the promoter is a heat sensitive promoter or a cold sensitive 

promoter, the method for the production of a biosynthetic product as above may 
comprise adjustment of the temperature of the growth medium to a temperature 
which is greater than or less than the optimal growth temperature for an 
untransformed Cory neb acterium species cell. For example, the yield of the 
biosynthetic product may be increased when said host cell is grown at a 
temperature adjusted to be greater than the optimal growth temperature for an 
untransformed Coryne bacterium species cell, or the yield of the biosynthetic 
product may be increased when the host cell is grown at a temperature adjusted 
to be less than the optimal growth temperature for an untransformed 
Corynebacterium species cell. 

DETAILED DESCRIPTION OF THE INVENTION 

[0047] The present invention relates to isolated polynucleotides that function as 

transcriptional regulators, in particular, promoters, in Corynebacterium species 
host cells, preferably Corynebacterium glutamicum, Br evib acterium flavum, or 
Brevibacterium lactofermentum host cells, even more preferably C. glutamicum. 
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host cells. These promoters are useful for regulating and enhancing the 
production of a variety of products in such host cells. Sources of promoters 
include nucleotide sequences from the 5' end of native chromosomal genes from 
Corynebacterium species, from sequences on plasmids that replicate in 
Corynebacterium species, from sequences in the genome of phage that infect 
Corynebacterium species, from sequences derived from other microorganisms, 
e.g., Escherichia coli, or from sequences assembled by humans (tac, trc) which 
are not found in nature. Genes of ribosomal proteins, ribosomal RNAs and 
elongation factors show high levels of expression. The promoters of these genes 
are candidates for increasing expression of amino acid bio synthetic pathway 
genes. 

Another reason for changing promoters of genes in biosynthetic pathways 
is to make the pathway independent of factors that control the pathway in the wild 
type organism. For example the native promoter of the operon that contains 
diaminopimelate decarboxylase of the lysine biosynthetic pathway of C. 
glutamicum can respond to arginine or lysine in the growth medium. Arginine 
increased transcription three-fold and lysine decreased transcription by one third 
(Oguiza,ef a/., JBact. 175:7356-7362 (1993)). Diaminopimelate decarboxylase 
activity decreased 60% in cells grown in minimal medium supplemented with 
lOmmM lysine (Cremer, et al, J Gen Microbiol. 134:3221-3229 (1988)). 
Replacing the promoter of lysA which encodes the diaminopimelate 
decarboxylase is one way to make lysine biosynthesis independent of arginine and 
lysine levels in media. 

Gene expression from many of the transcriptional regulators of the 
present invention can be easily and inexpensively regulated by manipulating the 
composition and/or temperature of the medium in which cells containing these 
promoters are cultured. Regulated expression from a promoter is especially 
useful for controlling the production of a gene product that is toxic to cells at high 
levels. To allow for maximum production of a gene product, a culture of cells 
possessing the gene of interest under the control of an inducible promoter can be 



-15- 



first grown to a sufficient cell density under conditions in which expression from 
the promoter is suppressed. Then, by manipulating the culture medium, 
expression from the promoter is induced. This strategy assures not only a high 
level of gene expression in the individual cells, but also a sufficiently high 
number of cells in the culture, so that maximum production of the gene product 
by the entire cell population is achieved. 

A. Definitions 

[0050] In order to provide a clear and consistent understanding of the 

specification and claims, including the scope to be given such terms, the 
following definitions are provided. It is also to be noted that the term "a" or "an" 
entity, refers to one or more of that entity; for example, "a polynucleotide," is 
understood to represent one or more polynucleotides. As such, the terms "a" (or 
"an"), "one or more," and "at least one" can be used interchangeably herein. 

[0051 ] Auxotroph. As used herein, the term refers to a strain of microorganism 

requiring for growth an external source of a specific metabolite that cannot be 
synthesized because of an acquired genetic defect. 

[0052] Amino Acid Supplement. As used herein, the term refers to an amino 

acid required for growth and added to minimal media to support auxotroph 
growth. 

[0053] Chromosomal Integration. As used herein, the term refers to the 

insertion of an exogenous DNA fragment into the chromosome of a host 
organism; more particularly, the term is used to refer to homologous 
recombination between an exogenous DNA fragment and the appropriate region 
of the host cell chromosome. 

[0054] Corynebacterium species. As will be understood by those skilled in the 

art, the terms "Corynebacterium" or "Corynebacterium species" includes those 
organisms previously identified in the literature as "Brevibacterium species," for 
example Brevibacterium flavum and Brevibacterium lactofermentum which have 



now been reclassified into the genus Corynebacterium. {Int. J. Syst. Bacteriol. 41: 
255 (1981)). Accordingly, the term "Corynebacterium species" is used herein 
interchangably with "Corynebacterium species and Brevibacterium species." 

[0055] Enhancers. As used herein, the term refers to a DNA sequence which can 

stimulate promoter activity and may be an endogenous element or a heterologous 
element inserted to enhance the level, i.e., strength of a promoter. 

[0056] Inducer. As used herein, the term "inducer" refers to molecule which 

acts to stimulate transcription from an inducible promoter. The inducer may be 
produced by a host cell, or added to a culture medium in which the host cell is 
being grown. 

[0057] Isolated Polynucleotide. As used herein, the term is intended to mean a 

polynucleotide, DNA or RNA, which has been removed from its native 
environment. For example, recombinant DNA molecules contained in a vector 
are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules 
maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. However, a nucleic acid molecule contained in a clone 
that is a member of a mixed clone library (e.g., a genomic or cDNA library) and 
that has not been isolated from other clones of the library (e.g., in the form of a 
homogeneous solution containing the clone without other members of the library) 
or a chromosome isolated or removed from a cell or a cell lysate, is not "isolated" 
for the purposes of this invention. Isolated RNA molecules include in vivo or in 
vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

[0058] Lysine Biosynthetic Pathway Protein. As used herein, the term "lysine 

biosynthetic pathway protein" is meant to include those peptides, polypeptides 
or proteins, and enzymes, which are directly involved in the synthesis of lysine 
from aspartate. These proteins may be identical to those which naturally occur 
within a host cell and are involved in the synthesis of lysine within that host cell. 
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Alternatively, there may be modifications or mutations of such proteins, for 
example, the proteins may contain modifications or mutations which do not 
significantly affect the biological activity of the protein. For example, the natural 
protein may be modified by mutagenesis or by introducing or substituting one or 
more amino acids, preferably by conservative amino acid substitution, or by 
removing nonessential regions of the protein. Such modifications are readily 
performed by standard techniques. Alternatively, lysine biosynthetic proteins 
may be heterologous to the particular host cell. Such proteins may be from any 
organism having genes encoding proteins having the same, or similar, 
biosynthetic roles. 

[0059] Mutagenesis. As used herein, the term refers to a process whereby a 

mutation is generated in DNA. With "random" mutagenesis, the exact site of 
mutation is not predictable, occurring anywhere in the genome of the 
microorganism, and the mutation is brought about as a result of physical damage 
caused by agents such as radiation or chemical treatment. rDNA mutagenesis is 
directed to a cloned DNA of interest, and it may be random or site-directed. 

[0060] Mutation. As used herein, the term refers to a one or more base pair 

changes, insertion or deletion in the nucleotide sequence of interest. 

[0061] Operably Associated. As used herein, the term "operably associated" 

refers to a association of nucleic acid elements in a functional relationship. A 
nucleic acid is "operably associated" when it is placed into a functional 
relationship with another nucleic acid sequence. For instance, a promoter or 
enhancer is operably associated with a polypeptide coding region if it affects the 
transcription of the polypeptide coding region. Operably associated means that 
the nucleic acids being associated are typically close together or contiguous and, 
where necessary, join two polypeptide coding regions, contiguous and in reading 
frame. However, since enhancers generally function when separated from the 
promoter by several kilobases, some polynucleotide elements may be operably 
associated but not close together or contiguous. 



[0062] Operon. As used herein, the term refers to a contiguous portion of a 

transcriptional complex in which two or more open reading frames encoding 
polypeptides are transcribed as a multi-cistronic messenger RNA, controlled by 
a cis-acting promoter and other cis-acting sequences necessary for efficient 
transcription, as well as additional cis acting sequences important for efficient 
transcription and translation (e.g., mRNA stability controlling regions and 
transcription termination regions). 

[0063] Parental Strain. As used herein, the term refers to a strain of host cell 

subjected to some form of treatment to yield the host cell of the invention. 

[0064] Phenotype. As used herein, the term refers to observable physical 

characteristics dependent upon the genetic constitution of a host cell. 

[0065] Promoter. As used herein, the term "promoter" has its art-recognized 

meaning, denoting a portion of a gene containing DN A sequences that provide for 
the binding of RNA polymerase and initiation of transcription and thus refers to 
a DNA sequence capable of controlling the expression of a coding sequence or 
functional RNA. Promoter sequences are commonly, but not always, found in the 
5' non-coding regions of genes, upstream of one or more open reading frames 
encoding polypeptides. Sequence elements within promoters that function in the 
initiation of transcription are often characterized by consensus nucleotide 
sequences. The promoter sequence includes proximal and more distal upstream 
elements. Examples of proximal elements in bacterial promoters include a -10 
region and a -35 region, which are discussed in more detail, infra. Examples of 
more distal elements include operator regions and enhancer regions. As used 
herein, the term "endogenous promoter" refers to a promoter sequence which is 
a naturally occurring promoter sequence in that host microorganism. The term 
"heterologous promoter" refers to a promoter sequence which is a non-naturally 
occurring promoter sequence in that host microorganism. The non-naturally 
occurring promoter sequence may be from any prokaryotic or eukaryotic 
organism. A synthetic promoter is a nucleotide sequence, having promoter 
activity, and not found naturally occurring in nature. 
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[0066] Promoters may be derived in their entirety from a native gene, or be 

hybrid promoters. Hybrid promoters are composed of different elements derived 
from different promoters found in nature, or even comprise synthetic DNA 
segments. Hybrid promoters may be constitutive, inducible or environmentally 
responsive. 

[0067] Useful promoters include constitutive and inducible promoters. Many 

such promoter sequences are known in the art. See, for example, U.S. Pat. Nos. 
4,980,285; 5,631,150; 5,707,828; 5,759,828; 5,888,783; 5,919, 670, and, 
Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
Harbor Press ( 1 989). Other useful promoters include promoters which are neither 
constitutive nor responsive to a specific (or known) inducer molecule. Such 
promoters may include those that respond to developmental cues (such as growth 
phase of the culture), or environmental cues (such as pH, osmoticum, heat, 
dissolved gases, or cell density). 

[0068] Examples of environmental conditions that may effect transcription by 

inducible promoters include anaerobic conditions, elevated temperature, reduced 
temperature, or the presence of light. It is understood by those skilled in the art 
that different promoters may direct the expression of a gene in different cell types, 
or in response to different environmental conditions. Promoters which cause a 
gene to be expressed in most cell types at most times are commonly referred to 
as "constitutive promoters". It is further recognized that since in most cases the 
exact boundaries of regulatory sequences have not been completely defined, DNA 
fragments of different lengths may have identical or similar promoter activity. 

[0069] Relative Growth. As used herein, the term refers to a measurement 

providing an assessment of growth by directly comparing growth of a parental 
strain with that of a progeny strain over a defined time period and with a defined 
medium. 

[0070] Stringent Hybridization Conditions. As used herein, the term "stringent 

hybridization conditions" is intended to mean overnight incubation at 42 °C in a 
solution comprising: 50% formamide, 5x SSC (750 mMNaCl, 75mM trisodium 
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citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 1 0% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing 
the filters in O.lx SSC at about 65 °C. 

[0071] Suppressor or Repressor. As used herein, the terms "suppressor" or 

"repressor" refer to molecules which act to block or reduce transcription from an 
derepressable promoter. The suppressor or repressor may be produced by a host 
cell, or added to a culture medium in which the host cell is being grown. 
Furthermore, during the growth of a host cell, the suppressor substance can be 
metabolized by the host cell, thereby removing it from the culture medium, 
thereby increasing transcription from the derepressable promoter. 

[0072] Transcription factor. As used herein, the term "transcription factor" 

refers to RNA polymerases, and other proteins that interact with DNA in a 
sequence-specific manner and exert transcriptional regulatory effects. 
Transcriptional factors may be transcription inhibitory proteins or transcription 
activator proteins. In the context of the present invention, binding sites for 
transcription factors (or transcription complexes) are often included in the 
transcriptional regulatory element(s). 

[0073] Transcription factor recognition site. As used herein, a "transcription 

factor recognition site" and a "transcription factor binding site" refer to a 
polynucleotide sequence(s) or sequence motif(s) which are identified as being 
sites for the sequence-specific interaction of one or more transcription factors, 
frequently taking the form of direct protein-DNA binding. Transcription factor 
recognitions sites which bind transcription inhibitory proteins or transcription 
activator proteins are also referred to as "operator regions." Typically, 
transcription factor binding sites can be identified by DNA footprinting, gel 
mobility shift assays, and the like, and/or can be predicted on the basis of known 
consensus sequence motifs, or by other methods known to those of skill in the art. 

[0074] Transcriptional Complex. As used herein, the term "transcriptional 

unit" or "transcriptional complex" refers to a polynucleotide that comprises one 
or more coding regions for polypeptides, a cis-acting linked promoter and other 
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cis-acting sequences necessary for efficient transcription of the structural 
sequences, distal regulatory elements necessary for appropriate transcription of 
the structural sequences, and additional cis sequences important for efficient 
transcription and translation (e.g., mRNA stability controlling regions and 
transcription termination regions). In bacteria, a transcriptional complex may 
comprise a single coding region, or one or more coding regions, e.g., as part of 
an operon. 

Transcriptional Regulatory Element. As used herein, the term 
"transcriptional regulatory element" refers to a DNA region which activates 
transcription alone or in combination with one or more other DNA regions. A 
transcriptional regulatory element can, for example, comprise a promoter, 
response element, negative regulatory element, silencer element, gene 
suppressor, transcription terminator, and/or enhancer. 

B. Microbiological and Recombinant DNA Methodologies 

The invention as provided herein utilizes some methods and techniques 
that are known to those skilled in the arts of microbiology and recombinant DNA 
technologies. Methods and techniques for the growth of bacterial cells, the 
introduction of isolated DNA molecules into host cells, and the isolation, cloning 
and sequencing of isolated nucleic acid molecules, etc., are a few examples of 
such methods and techniques. These methods and techniques are described in 
many standard laboratory manuals, such as Davis et al, Basic Methods In 
Molecular Biology (1 986), J.H. Miller, Experiments in Molecular Genetics, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1972); J.H. 
Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 
Sambrook, E.F. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 
Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
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York ( 1 989); P .B . Kaufman et al. , Handbook of Molecular and Cellular Methods 
in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glickand J.E. Thompson, eds., 
CRC Press, Boca Raton, Florida (1993); and P.F. Smith-Keary, Molecular 
Genetics of Escherichia coli, The Guilford Press, New York, NY (1989), all of 
which are incorporated herein by reference in their entireties. 
[0077] Unless otherwise indicated, all nucleotide sequences newly described 

herein were determined using an automated DNA sequencer (such as the Model 
373 from Applied Biosystems, Inc.). Therefore, as is known in the art, for any 
DNA sequence determined by this automated approach, any nucleotide sequence 
determined herein may contain some errors. Nucleotide sequences determined 
by automation are typically at least about 90% identical, more typically at least 
about 95% to at least about 99.9% identical to the actual nucleotide sequence of 
the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods well 
known in the art. 

C. Polynucleotides 

[0078] Certain embodiments of the present invention are directed to an isolated 

polynucleotide comprising a first nucleic acid, the sequence of which is related 
to, or identical to, a nucleotide sequence, or fragment thereof, selected from the 
group consisting of SEQ IDNO:l, SEQIDNO:2, SEQIDNO:3, SEQIDNO:4, 
SEQ ID NO:5, SEQ ID NO:6, SEQ IDNO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, 
SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID 
NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ 
ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, and SEQ ID NO:30. 
SEQ ID NOs 1-3 have been previously described. See, e.g., Reinscheid et al, 
Microbiology 145:503 (1999); U.S. Patent No. 5,700,661; and U.S. Patent No. 
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5,965,391. SEQ ID NOs 23, 24, 26, 27, 28, and 30 have been previously 
described. See, e.g., Ben-Samoun et al., FEMS Microbiology Letters 174: 125- 
130 (1999), Brosius, J. etal, J. Biol. Chem. 260, 3539-3541 (1985), Amann, E., 
etal, Gene 69: 301-315 (1988), Moeckel, etal, J.Bacteriol. 174:8065-8072 
(1992 ), Keilhauer, et al. J.Bacteriol 175:5595-5603 (1993) and Patek et al, 
Appl. Env. Microbiol, 60:133-140 (1994). Isolation of polynucleotides 
comprising nucleic acids the sequence of which are identical to SEQ ID NOs 4-22 
are described herein. While not being bound by theory, it is believed that 
polynucleotides having these sequences can be used to regulate gene expression 
in Corynebacterium species. The genes regulated by SEQ ID NOs 4-22, 
determined by similarity of the genes to genes identified in other organisms, are 
listed in Table 1 A, along with putative exogenous regulatory molecules. SEQ ID 
NOs 1-22 are listed in Table IB. 



TABLE 1A 

Nucleotide sequences that can be used to regulate gene expression 



Seq. ID. NO: 


Gene* 


Regulatory Molecule* 


1 


pta 


acetate 


2 


aceA 


acetate 


3 


aceB 


acetate 


4 


adh 


ethanol 


5 


aldB 


ethanol 


6 


poxB 


pyruvate 


7 


ldh 


pyruvate 


8 


amyE 


carbon 


9 


malZ 


carbon 


10 


bglX 


carbon 


11 


gam 


carbon 


12 


glgX 


carbon 


13 


hisD 


histidine 


14 


pyrR 


pyrimidine 


15 


purD 


purine 


16 


hrcA 


temperature 


17 


htpX 


temperature 
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1 8 dnaK temperature 

19 etc temperature 

20 grpE temperature 

21 clpB temperature 
22 narA oxygen 



Sequence I.D. NOs 1, 2, and 3 have been previously described. The remaining 
sequences were discovered in ADM's Corynebacteriumglutamicum genome sequencing 
project. 

* Putative genes regulated by sequence I.D. NOs 4-22 were determined by homology 
to genes identified in other organisms, e.g., Escherichia coli or Bacillus subtilis. 
t Putative regulatory molecules associated with the regulatory regions of SEQ I.D. NOs 
4-22 were determined by analogy to regulatory regions identified in other organisms. 
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Table IB 

Nucleotide sequences that can be used to regulate gene expression 



SEQ ID NO: 1 

CACCGACAACGGCAACACGCAAAGGGCGAGACATATAAAGTTCGATTCCTTAAAGGGGTTCTAAAAAATG 
TGGAGTATGTGAGCGGGGTTCCACTAGTAGATTCGACTCCTATCGGGGTGCGACTGCTAATGGTGCCCTG 
CTATCAACCCTCCATGATACGTGGTAAGTGCAGACTAATAAAGGCCAGTCGGGGAGGATTGGGGGCTTTG 
CTGGGGGCAGATTTGTCACGCTGCGCGCTTTCATAGACCCCATTAATGTGGGGTGAAGAGCTGTAAAGTA 
CCGCTAAAAACTTTGCAAAGGGTGCTTCGCAACTTGTAACCGCTCCGTATTGTTTTCTACGGCAATAAGC 
ATTTGTGCTGCTCAAAGCGTGGAATTGAGATCGGTTTGAAAATTACAAAATAAAACTTTGCAAACCGGGC 
TGTACGCAAGGCGGACGAACGCTAAACTATGTAAGAAATCACAACTTCCCCTCAGTAGTGCCAGGAGGCA 
CAAGCCTGAA 



SEQ ID NO:2 

ACTCTTTTAAGAAAAGCACTCTGACTACCTCTGGAATCTAGGTGCCACTCTTCTTTCGATTTCAACCCTT 
ATCGTGTTTGGCGATGTGATCAGACTAAGTGATCACCGTCACCAGCAAAAGGGGTTTGCGAACTTTACTA 
AGTCATTACCCCCGCCTAACCCCGACTTTTATCTAGGTCACACCTTCAAAACCTACGGAACGTTGCGGTG 
CCTGCATTTTCCCATTTCAGAGCATTTGCCCAGTACATCTGTACTAGCAACTCCCCCGCCCACTTTTTCT 
GCGAAGCCAGAACTTTGCAAACTTCACAACAGGGGTGACCACCCCCGCACAAAACTTAAAAACCCAAACC 
GATTGACGCACCAATGCCCGATGGAGCAATGTGTGAACCACGCCACCACGCAAACCGATGCACATCACGT 
CGAAACAGTGACAGTGCATTAGCTCATACTTTGTGGTTGGCACCGCCCATTGCGAATCAGCACTTAAGGA 
AGTGACTTTG 



SEQ ID NO:3 

TTGCGTGGTGGCGTGGTTCACACATTGCTCCATCGGGCATTGGTGCGTCAATCGGTTTGGGTTTTTAAGT 
TTTGTGCGGGGGTGGTCACCCCTGTTGTGAAGTTTGCAAAGTTCTGGCTTCGCAGAAAAAGTGGGCGGGG 
GAGTTGCTAGTACAGATGTACTGGGCAAATGCTCTGAAATGGGAAAATGCAGGCACCGCAACGTTCCGTA 
GGTTTTGAAGGTGTGACCTAGATAAAAGTCGGGGTTAGGCGGGGGTAATGACTTAGTAAAGTTCGCAAAC 
CCCTTTTGCTGGTGACGGTGATCACTTAGTCTGATCACATCGCCAAACACGATAAGGGTTGAAATCGAAA 
GAAGAGTGGCACCTAGATTCCAGAGGTAGTCAGAGTGCTTTTCTTAAAAGAGTTTTCACAACCGTTAACG 
GCGTAGCCAAACAAGAAGGATTCGCATTCTTCTGGTTTAGGCACAGGTCATCTAAACCCATGCTTTAAAA 
GGAGCCTTCA 



SEQ ID NO:4 

ATTGGAAAATGGAGATTAGAAGAATCCTGGGAATGTTGGTGTGTGTTGCATGTCTGTTGAGACTATCTAG 
TAGATGCGGCTGCGCTCCTTAATTGCATGCTGGGGTGGTGGGGAATGGGTGGTTGGGGGCGTCGAAAAGC 
ATTTTTGGTGCTTCTAAGGGAATTGTGTGAATCTTGGAAAGCTAATTGAAAAACATTCCCATTAGTGGGT 
GATTTGCTGGAGTTTTGTGAATCTATTTTTCGAAATTTCAGCGTGCGGGGGTGGTTTGTTTTTTTACAAT 
TGCCAGTTCATTCACGGTTGTTGAAATGTTCGGGGGTAATAACTCAACTTTCTATTTTCACCTTGTTGGG 
ATTTCGCTAGGGTGGACGATGGCAGCAATTGAATGTTGTAAATCACACAATTGCAAGGATTGTAATTTAA 
GGCACATCTATGTCGGTGTGAAATTACATGTGCCAGAAGAGCAATTTGCCAAGTAATCCAAGCGAGAAGG 
AGTGAGTTTT 



SEQ IDNO:5 
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CGATCATCGAACTCGGCGAAGAAAACCGAAACCTCAAAGAATCCCTGCGTAAGGTCACAGCTGAGAATGA 
GCAGCTCAAAGATCAATTACGCAGCGGGCGTCCGCGTGGCGAGCTGGTGCACGTGCCCCGCTCCACCGCG 
GTGGTCATGTGGGAACGCCGCAAGGGGCGTTCCAAGTAAAAACATGCTTGTCGACGCCGCTTTCTAGCAA 
ATTAAGCGGGCACCTCCATTTATCTTTTGGAGGTGCCCGTTTCGTGCTTTCGCCAATTAGATACATGCAT 
AACCACCCGAACAGGGGTAATAACTTTTGAAAGGCTTTCGGCGTTGAGCTGCGAGAATTTTGAGAAAAGG 
GGGTGAATTTAACAGGGGTTCTAGCGCGGATTGATTTTCGTGAATATGGTGGCTGCTAAGCGTGCGAATG 
TGCGCGTTATCACAATCGTTGACCAAGTGTCACCTGACGCACAGGTAGTGCTCAGGTGGAGGTGGCCCAA 
AGGAGACCCA 



SEQ ID NO:6 

TTTTAGACCACGGCGCTGTGTGGGGATTTAAGACGTCGGAAATTGTAGGGGACTGTCAGTGTGAGTCGGG 

TTCTTTGAGGCGCTTAGAGGCGATTCTGTGAGGTCACTTTTTGTGGGGTCGGGGTCTAAATTTGGCCAGT 

TTTCGAGGCGACCaGACAGGCGTGACAAGATTGACTAAAAAACCGAAGTTTTGGCACGTGTGTTTGGTTT 

CTCGGGGTCTAAACCGGACAGGCGTGACAAGATCTGGCGAAATCGCAGGTTTTTGTCACGCCTGTCTGGT 

TTTACCTTTTGGGGGCCCGAACTGCCCTGAACTACTCGGATCGACCAAGCAGTTTGGCCTCCAGCGCTCT 

GATCAAGCACCCAACCGCCTCTAAATCACACCAAGGCACTCGTAAAACCCGTGGCAGATAGAGAAAGTGT 

GGCAGCAACTCGAATTGAAGAGCACAATTGAAGTCGCACCAAGTTAGGCAACACAATAGCCATAACGTTG 

AGGAGTTCAG 



SEQ ID N0:7 

AAAACAGCCAGGTTAGCGGCTGTAACCCACCACGGTTTCGGCAACAATGACGGCGAGAGAGCCCACCACA 
TTGCGATTTCCGCTCCGATAAAGCCAGCGCCCATATTTGCAGGGAGGATTCGCCTGCGGTTTGGCGACAT 
TCGGATCCCCGGAACCAGCTCTGCAATGACCTGCGCGCCGAGGGAAGCGAGGTGGGTGGCAGGTTTTAGT 
GCGGGTTTAAGCGTTGCCAGGCGAGTGGTGAGCAAAGACGCTAGTCTGGGGAGCGAAACCATATTGAGTC 
ATCTTGGCAGAGCATGCACAATTCTGCAGGGCATAGATTGGTTTTGCTCGATTTACAATGTGATTTTTTC 
AACAAAAATAACACTTGGTCTGACCACATTTTCGGACATAATCGGGCATAATTAAAGGTGTAACAAAGGA 
ATCCGGGCACAAGCTCTTGCTGATTTTCTGAGCTGCTTTGTGGGTTGTCCGGTTAGGGAAATCAGGAAGT 
GGGAT CGAAA 



SEQIDN0:8 

CCAGATCTCGGGATGCTTCGTAGATCAGCATGGCGCCGAGGTCGTTGGCTGCTGCACGGAAAGCTGCGTT 
GTCGCTGCGCTCGTCGCGCAACAGGGGTAGGCGGCTAGCAACGAGTGGGTGGTTGACGATGGTGATGTCC 
ATAGCTTCACATGTTAAATCATTGCCGCCCAGAAGAAGACCGCGCGGGCGAATTTGGGCTTGGAGGGAAC 
CAAACGGCCACTTTTCCAGTCCAACAAAGTATGAGGATTAATTTGCCCCACTCCAAAGAGCTCGCCCACG 
AGCTGTGTTTGTTGCCCACCCCTGCTGTGCCCGCGCTTCCCACTGATTCTGGCGCGCAGTTTGATATCCA 
CCAGGCACTGTCCGCCTCTCTTGCCACCTATGCCCGCAACCTCACCTTGCTGTCCCACACCGCCGAGAAT 
TTAGGAAACCGCGCTCTGACCGGCCTCGCTGAAATCGAAGACACCGACGACCAACTCGCACACGCATTGG 
AGCGCCTGAC 



SEQ ID NO: 9 
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GGGCATCATGGGCATTGTTGGTGGATTCACCATGCCTGTCGCAATCGCTAGGACCAGGGATAAGAACCTC 
GTGTGGTTCCCCGTGGTCTTTGGTGCATCGATGTTCCTCGGTTATGTGGGAACGTGGCTGTGGCCGTCCC 
AAGGCTGGTACCTGTGGTCATTCCTTCTTGGTTTAGGTGGACTCTGCTTCCCGATGGCTATCGCCCTGAT 
TCCAGCGCGTACGAAAGATCCGAGAATTACCGCAAGCTTGTCTGGATTTGTGCAGCCGGTGGGTTACATT 
CTTGCAGCCCTTGGGCCATTGGCAGTGGGAGCGATCTACCAGGCGATTGGCTCCTGGTCAGAGATCCTCG 
TTGGTTTGGCCTTGGGCACAATAGTGTTGTCGATTGTGGGATTCAGAGCAGCACGCAATGTGACGGTTGA 
TGATGAATTGAGGAGATCAAAGTAGCCTCAACTAAGCGTCGCGATAAGAACGAGGGGCAAGGCTGATGTA 
CTCTGTCAACCATGGATAAACCGGTCGTGAGGGATGCAGCTCTGCTGATTTTTCGCGCTGTGCTCGGAGT 
GATCTTTGTGGCACACGGGTGGGAAAAGCTGTTCATCTCCGGAGTTACCAAGACAACAGGACAATTTTCA 
GCCTGGGGAGTGCCTCAACCCAAGCT CTCGGTGTGGATCACATCGATCTCTGAGCTGCTCGGTGGTGCCT 
TCCTAGTGGTTGGTTTGCTCACCACCTTTGTTGCTGGTGCATTAGCGCTGTTGATCGCCGCTGCTATTTA 
CTTTGTGCACTTGAGTTCGGGCTTTTTCACAGTTGATAACGGCATCGAATTCCCCTTGCTCATCATTGTT 
TCTTTGCTCGTGATCGTTGTGTTTGGTTCTGGTAGAGCCAGCGTTGATGGGGTGCTCACGCGTGGTTGAC 
TGTAGTGCCATTCAAGCCGCGCTGTCCGCCAAATTAGATGGTGAGCCGACAGGCTTGGATGATGCAGTAA 
TTGAAGCGCACCTCGCTAATTGTGAAGAGTGCAGAAATTACTACAACCGTGCTGCTGAGTTGAATCGGAT 
GCTCAATTTTTGCGCCGCGGAACCTCGCACCCTGACCCCGCCTGATCTATCAGAGATCATTCTGGCAGAG 
GTGGAACCAGAATGGCGCAGGCATGCCAACGCCAAGGTTGTGGGATCCCTGCTATCGCGAGTGTTGTTGG 
TGATCCTGGGTGTGGTTTACCTCGCCTGGGGTATCACAATGTTGGGGGATTCGGCGTCGATAAGCGTCCA 
AGAAGACCCGCTCACCTCGCGCCTGCTCGCGGAGGCCGTTGCCTACCGCATTGCTCTTTCTGTGGGGCTG 
TTATTTGCGGCGTGGAAGCCGCGGATTATCGCGGGCATGCTCCCGATTTTTGGAACGTTGTGGACATTTA 
GTGCTGGTTTTGCTGCGCGCGATCTCGTGTTTGGCGTCGCCGATTCACAGACGGGACTGTCCATTGGTCT 
GCTGTTGATTTCTACGATTGTGCTGTCGTTTGCGTTGGTGAATAGTTCTGGACCGGGTATTTTGCGGCGC 
ACATGGAACTCATTGAACGCCGCGCCCGGCTAAGGTGGGAGGC 



SEQIDNOrlO 

TAATACGGCATCCGGCCTGAAAGGACAGTGTGGAACCCAAAAATCATGCCCTCGCAGAATCGTGTTTAAG 
GGGTTAAAACGCCTCGACCCACACTCTGACCCATCCGTGAAACTAGACCTCTTAAAACGACTTCTGGTTG 
GTGAGGTGTGAAAACCGCACTGCGGGGCCCAATTCACAAAAATCCATACGAGTGTGCACACCCGATTTTC 
ACATCGCTTCGAGACCTCCCTTTTTGACACCTTAATTGTCTAACCCCGTATAGGTGAGAAATGTTGGACA 
AGTGTCTGTTTTTGTGGGGGGAATCTGACTACGATGGTAAGAAACAGGGAAAGGGGTTACCATTATGTCT 
CAAGAGTAGCCTCAAATCGGCTCCCGCCTCTCTCGTGTCATTGAACAAGACGGCCTACAATTCCGCGATC 
TCGACGGCGACGGCGTACTTGCACCTTATGAAGATTGGCGTCTAACCCCAGCAGAGCGTGCCGCTGACCT 
GGTGAAACGA 



SEQIDNO:ll 

CACCGCGTCGCCGATCTCTCCCTTCCAGACTCAGAAGTTGCTTCGGCAATTGCACTTGAGTGGAAACCAG 
GCGTCGGATTCACTACTTGAGCTGCTGATTTGTAGGTTTTAAGACCTTGAAGATATAGTTAATTCTCGTT 
GCAAGGAACCAGATTCCAAGCAATGCGTATTCCTCCATAGCTCAGTTGGCAGAGCATTCGACTGTTAATC 
GAAGGGTCACTGGTTCGAGCCCAGTTGGAGGAGCAAATTGAAACCCACTGTTTTTTAACAGTGGGTTTTT 
TTGCATGTTTCATACAGTTAACGAACCCCAATTTTGTAACTTCCACTCTCCTAGCTATGATGAATACTCG 
TTGCAAGGAAGTAATTCCTTTCCAATACTTATTCCTCCATAGCTCAGTTGGCAGAGCATTCGACTGTTAA 
TCGAAGGGTCACTGGTTCGAGCCCAGTTGGAGGAGCAATACACACAGCCCGCCGTTTTTCTTAAAACGGC 
GGGTTTTGTT 



SEQ ID NO: 12 
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GGTGTCAGTGTATCCGCGAACCAGAGCCTCGATGGCAGCTGTGTCTTCTGGGGAAGCGGTCTGGCCGTTT 
TCAACTGGCGCAGCTTCAGCGAAGGTAATTGGGTTCTCAGACAGCTCTTGTGACAGTACATCAAGCTGTG 
CTTGCTGCTCTTCGTTGATGGTTGTTGCTTCGGAAGATTCAGATTCAGATTCAGATTCGGACGAGGTGGT 
TTCTGCTGCTTCCGCCGAGGTCGTTGCGCTGGAAGAGGAAGAATTTGTCGTCGTCGCTGCGCTGGATGAG 
GCTGCTTCTGTATCAGATGACTCACTGCTACATGCGGTTAAGAGAAGTGGGGTGACCATGAGTGCTGCGA 
AAGCAGCCTTCTTTGAGGAAAGGCGAATAGACAAAGTTCTGCTCCTGATAAATCATCGACATGCTCCGGA 
AAACTTAAAAATTCCCGGACGGTTCACGCAGATTACCCTAGCAAAGCAATCTAGCTGACGACCCAATTTA 
GTCCTGTCATTATGCTGGCAATTGTGCAGCTATCTAAAGAATCTATTATTGGGGCAGCCGTTTCGATCCT 
GAGCGAGTTCGGTTTGTCGGATATGACCATGCGTCGCGTCGCAAAGCAATTAAATGTCGCGCCGGGCGCG 
CTGTATTGGCATTTTAAAAATAAGCAGGAGCTTATCGACGCCACCTCACGCCATCTCCTGGCGCCTATCT 
TGGGGCGCAACGACGAGCAGCGAGCAAGCATTTCCGCGCAGGAAACGTGCGCAGAAATGCGTTCACTGAT 
GATGCAAACCAAAGACGGTGCGGAAGTCATCAGTGCCGCACTGAGTAATCAGCAACTGCGCCAAGAATTG 
GAATCTCTCATTTCCGACTCTTTAAAAGAACCTAATGAGGTCGGTGCTTTTACGCTGCTGCATTTTGTGG 
TGGGTGCAGTATTAACAGAACAAACTCAGCTGCAGATGCACGAGTTCACGGCTGGCGCGGAAGATGACAC 
ACAAGAAAACCCTGCCGATGCGAACTTTGAGGAGAGATTCAATCAAGGATTAGAAATCATTCTGGCGGGT 
CTAGACGCGCTTGGGCATATAAGATAGCGTTCT 



SEQIDNO:13 

AGTGACAGCTCGCGCCGCATCGTTGATGGCAAACATGTCCAAATTAAGGCGCATGCGACCAAGGATGGTG 
AGCATTTGCCCTGATTCGTGGCTGAAGTACATCGAAATGCGGTGATCTTGCCACGGCACGATGATGCGAT 
CTTCTGAGCTCAAATAGTGGTAGCCCAAGGAATCAACAGCCTCCGTCACTCGATTCAGGTCAACTGGGAA 
AGGAATGGAGGTGTTGGGTGCAGGGACGTTTCGATCATTCACAGAGGTGAAATCCATTCCTTCCAGTATC 
TCAAAGGTGAAAGCGGGTTTAATTCAGGTAAATCTGGGGTGGTCATTTTAAGTTTTAAGTCTAATTCAAA 
TGAACTCTGATGTACCCAAATCAGAAACTTGTTACGTGGGGAATACAATAGGTAAATATGCGGGCTTAAG 
AACTTGTGTTGAGGCCGCTTGGATTCGGGCACCGAGCTCGAAGAATTTCGATTCAACCTTTTAAGGGAGA 
ACTTTTCGCC 



SEQ ID NO: 14 

CGTGGCGTGCCACCCATTTAAGTCCCGCGGGGAGACTGAAGATGGTGGTGCCGTCGGCGAGGCGTTCTGC 
CTGCGTAATGGGGTTAAGTGGGATGAACGGGGGAGTCAGACGTGCGACAGCGCCCTTGCGGGTATGCCAA 
TCCCAGACCATTTCTCGGGGAAAAGGAATAAAATGGCTTGTGGTCAGACTCACAGGGGCTTCTCCAAGTC 
AGTGGATTTATGAGGTCCCAGTGGGTACACACCGGGTGTCCTACAACGATCAATTGTCACAGATTCGACT 
GGCATGCTGTACCATCTGCTTTAAGCATTTTGGTGTTTCACTGTTGTTAACAGTGTTTCACCGTGGAGCA 
CTACCTTAGATCATAGTCAGCATCTTGGGGTGAATGTGACACGGTACGCTATAGTGTCAGACAACAACCA 
GGAAACTGGTCGTTGCAGAGTTTTTGCAAAATTGGACATCCTTTAACGGACCGCACAGAGAGGCGGGGAA 
GGAGGTCACG 



SEQ ID NO: 15 

GCCATGGCGTTGCGGAAATCGTAATCGGCCATTTTGTCGGTGGGGAAGAGGTGGATGTGAGTGTGGGGAA 
CATCGAATCCTGCGATGATGTAACCACATCGAGGGGCGTCGAATGCTGTGCGGATTGCATTTCCGATGAG 
CTGGGAGGCCTCGTTTACTTCGCTCCAGATGTTCTGAGGAAGGTCGGTCCAGCGGTCAACTTCTGCAACG 
GGTACGACTAGGGTGTGGCCGTAGGTGAGGGGTTCGATGGATAGAAAAGCCACGACATTCTCGGAACGAT 
ACACAAATCGGCCGGGGAGCTCGCCATTAATAATTTTCGTGAATACAGAAGCCATATGCACAGACTACTA 
CTTGGCGTGCAACCAAATTGAGGTTGCATAAAATAATGCAGGTGAGCCGGTCATTTTAAGGCGCTTTTCG 
ACGCCACTTTCAACCATTTCCGAACCGCCAAGAATACTGGAATAGCTTGGATCAAGTTTTGCAGGATAAA 
CTGTGCAACC 



SEQIDNO:16 
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ACGCTGATCGTTTTGAGCTTATCGACGCTCGCCTGCGCTCAGCTGGTTTCGATTGGTACGAGGTATCCAA 
CTGGGCGAAACCCGGCGGAGAATGCAAGCACAACATGGGCTATTGGGTCGACGGCGACTGGTGGGGTGCT 
GGCCCGGGCGCGCACTCGCACATCGGCGACCGCCGCTTCTACAACATCAAGCACCCAGCGCGTTACTCCG 
CGCAGATTGCGGCCGGCGAGCTGCCCATTAAGGAAACAGAGCGGCTGACGGCGGAAGATCACCACACCGA 
GCGCGT CATGCTTGGTTTGCGCCTGAAACAAGGCGTGCCGCTGAACCTTTTCGCACCCGCAGCGCGCCCG 
GTCATCGACCGTCATATCGCAGGAGGCCTGCTGCACGTCAATGCGCTGGGCAACCTGGCGGTGACCGATG 
CGGGACGTTTGCTTGCCGACGGCATCATCGCCGACATTTTGCTTAGTGAAGAAGACTAAATATTTAGTAG 
GGTTACAGAC 



SEQ ID NO: 17 

TGGCGACTATGCCACATAGTCGACTACCTTGCATAGTTGACTATTTGATTGAGTTGAACTGTGTCAGTGT 
ATGAAGAGAAAAATGAAGAGAAAAACAGCTGCACATGGTTTCAAAAGATAAGGAAGTGAAGCATGAGCAT 
CGAGCCAGGAATCCCCACGCTTGGACCGCTTGAAGAACAAGTCATGCACATTCTGTGGGATCACGGAAAA 
TTGACAGTCCGTGAAGTCATCGAATTCCTTCCAGGTGATCCTGCGTACACAACGATCGCAACCGTCCTGC 
GTCACTTGGGCAGAAAAGGCATGGTCACCATTGTGAAAGATGGTCGGACTGCTCGACACAGCGCGTTGAT 
GAACAGGGAAGAATACACCGCTGGCGTCATGGATCAGGTGCTGTCGACCAGTCGGGATCGCACCGCATCA 
ATTCTGCATTTCGTGGATACGATCACGGCGACTGATCGCGAGCTGCTTCTGGAGTATCTGCAACAGCAGG 
AGGGCAGGAA 



SEQ ID NO: 18 

AGACTTGGCGTCAAAGTTGAGTGGGACCTGTTGAAAACAGCTCGTATCTTCCCTGATTGGGTGGTTGAAA 
TTAGGGGTAAACCCGGATTTTTTCTCAAGTGAAGCGCTTTGACCTGTGTAAATTAAGAAAGTTGAGTCTA 
GTGGGAACAACTTTGTGGCATTTACCGTTGCCATATATGTAAGCTTGAGTCAGGCAGGCTCAATGAGGAG 
TTTTTCTTACCGGCGAAAGTCGGTGGAAGCAAGTCAAAGCTCAAGCCGTGGACAATACTAAAATCACCTA 
AAACAGGAGGCACCATT 



SEQ ID NO: 19 

CGGGAGCCGTTCTTCTCCACGATGGGGCGCTCGGTGAAATCACTGAAACGCGCGGTTTCCAGAAGCATCA 
CCACGATGGCGTCACTGACCTCAAAAGATGCAGTGTGCTCATCACTGAAGACGGTGTTTTCTTTGAAGCC 
AAGACCTGCATAAAAAACGCTTAGATGCTGCAAGGTCAGATACTGGTAGGTTGATGAAAATCATGTCGTG 
CTGAAGTCGTGCCATGGGAGTTCCACTTCCTCAAAGTGCCTTTTGGCTAATGTGACCCCAACCAGTATGC 
TGCCGGTTTTTGGATTAGGTTTGGCCATCGTGCTAGCATATTTAGGTCTCGGCGAGGGTCAAGTACTTTT 
AGTGCTCAACCGTTATCGACGCGATCTAGACTTCTAAAGTGCACTTTTGTGCGCTGCCTGCGAAGACTCG 
ACCAAGACATTCGAGTCGGTCGCGGGCATTTTTTATTTTCGCGGCCGAGTGTCCACCTTCATCCATGAGG 
AGAAATCACT 



SEQIDNO:20 

TGGATTGTCTGGCGGACAAAGTTCGCACCGATAAACCCGGCACCACCGGTCACAAGCAAAGAAGTCATGG 
TCGTCATAGTAGTTGCGGCTCCTGCAGTTGCGCGTGTTATCGCTGACGGGCATGTCAATTCAGCCGTGAC 
CCAGCTTCGGTGCGGTGTGAAAACGTATGGGGAAAAGCCACGGATAGTGCATGCCCACTTCTCCACATGG 
GCTGTGCTCAGCTGTCGTAGGGGCCACGGCTTGCGGACTTTCTGAACACCCGGACGAAAGCCTCCATTAT 
TGATGGCACCCACCCGGGGGTGGTATCGGTTGCTAAGTTCACAGTGGTCGGAGAGAATAAGTCAGCACTT 
CTGATTACATGTACGAAATCAGGGCGGTCAAATGCCAGTTCCCACCCGGCTTCACTAAGGCGGGACGACC 
ATAGAAACGCAGCTTCTGACATTGAATGCGCCGGTTTGCTATGGACTACACCGCATATGAGGAAAGGGCT 
TGAAACGCAC 



SEQ IDNO:21 
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CTTAATGAGTACGGCGTGGCTTATATCGGTTTCGCTCCGTTTGAAATCGGCATGACCATCATCGCGCCGA 
TCGCGGTGCTCGCAGGCTTTACTATGGGGTTGGGTTGGGCGTCGTTGATTGTTGCGATCGTGATTTTTGG 
CCTCGCGTGGGGTCTGAAGTGGTTGCCGGAGCGCGGACATGTCCGCGGCGAGGGTAAGCCGCAATAAAGG 
TTGGAAGCGCCGGGTCTAGGTCCGGCGCTTCTTTCGTACGCTTTTCGACGCCTCCCTCCACGTAATATTA 
AAGTTACGGGTTTTCCCTGATGCTTAAGTGGTAGTCAGTGCTTAAACTTGACTGCGGTCCACTCAATTTA 
TTTTCAAATTTTTTGAACTTGAGTGGAACATACTCAACTCTTTGTGCGTTATAGATATTAGAGAGTTAAA 
TAATGGCGCTTGACCTGCAGGAAATTGAGATCAACACTGATTGTGTAGGTTGGCGCCCAACAAAGAAAGG 
GCGTTGAAAG 



SEQ ID NO:22 

TGGCCATTCCTGTGGTCCATGCTCGCCCTGTTCTTCTTCACTGGACTGGGCAACGCCGGCACATTCAAAC 
AAATGCCCATGATTTTGCCCAAACGCCAAGCAGGTGGCGTGATCGGCTGGACCGGTGCCATTGGTGCCTT 
CGGCCCCTTCATTGTCGGTGTCTTGCTCTCCTTCACTCCAACTGTCGCGTTCTTCTGGGGCTGCGTGGTG 
TTCTTCATCATCGCCACCGCTTTGACCTGGATCTACTACGCCCGCCCGAACGCTCCATTCCCGGGATAAA 
CCGAAAGGCCAATCCATGACTACAACTACTTCTTCTGGGAAGTCTTCTGAACAGTCTTCTGAAAAGATCA 
ACCCCCTCTTCAAGCTCGGCAGTTTCCTAAGAAAAGGCACCGT CGGTTCTGAAGGCCAGCAGATTTTCCT 
TCAGGGCGGACGCCAAGCCGATGTGTTTTTATCGCAACCCGATGGGCGTTCGATAAAAGT CGTGCGCTCC 
C AC AC AT G G C 



[0079] In certain embodiments, polynucleotides of the invention comprise a first 

nucleic acid, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%o, 98% or 99% identical to a sequence selected from the group 
consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
IDNO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ IDNO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ IDNO:21, and SEQ ID 
NO:22, or a complementary sequence thereof. Of course, this embodiment also 
encompasses a first nucleic acid, the sequence of which is identical to any of the 
recited nucleic acid sequences. 

[0080] By a polynucleotide comprising a first nucleic acid, the sequence of which 

is at least, for example, 95% "identical" to a reference nucleotide sequence is 
intended that the first nucleic acid sequence is identical to the reference sequence 
except that the first nucleic acid sequence may include up to five mismatches per 
each 100 nucleotides of the reference nucleic acid sequence. In other words, to 
obtain a first nucleic acid, the sequence of which is at least 95% identical to a 
reference nucleic acid sequence, up to 5% of the nucleotides in the reference 
sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be 
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inserted into the reference sequence. The reference (query) sequence may be any 
one of the entire nucleotide sequences shown in SEQ ID NO:4, SEQ ID NO: 5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO:15, SEQIDNO:16, SEQ ID NO:17, SEQIDNO:18, SEQ ID NO:19, SEQ 
ID NO:20, SEQ ID NO:21, or SEQ ID NO:22, or any fragment of any of these 
sequences, as described infra. 

As a practical matter, whether any particular nucleic acid sequence is at 
least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to, for 
instance, a nucleotide sequence selected from the group consisting of SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or a 
complementary sequence thereof, can be determined conventionally using 
sequence analysis computer programs such as a OMIGA® Version 2.0 for 
Windows, available from Oxford Molecular, Ltd. (Oxford, U.K.). OMIGA uses 
the CLUSTAL W alignment algorithm using the slow full dynamic programming 
alignment method with default parameters of an open gap penalty of 10 and an 
extend gap penalty of 5.0, to find the best alignment between two nucleotide 
sequences. When using CLUSTAL W or any other sequence alignment program 
to determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of the 
reference nucleotide sequence such that gaps, mismatches, or insertions of up to 
5% of the total number of nucleotides in the reference sequence are allowed. 

This embodiment of the present invention is directed to polynucleotides 
comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
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NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:ll, SEQ IDNO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ 
ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 
SEQ ID NO:21, and SEQ ID NO:22, or a complementary sequence thereof, 
irrespective of whether they have functional activity. This is because even where 
a particular polynucleotide does not have functional activity, one of skill in the 
art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe, an SI nuclease mapping probe, or a polymerase chain 
reaction (PCR) primer. 

Preferred, however, are polynucleotides comprising a nucleic acid, the 
sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 
or 99% identical to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:4, SEQ IDNO:5, SEQ ID NO:6, SEQ IDNO:7, SEQ ID NO:8, SEQ 
ID NO: 9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3 , SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or a 
complementary sequence thereof, which do, in fact, have functional activity, in 
particular, polynucleotides which are capable of regulating transcription in 
Corynebacterium species. Examples of polynucleotides which regulate 
transcription include promoters, operators, transcription factor recognitions sites, 
transcriptional complexes, and transcriptional regulatory elements, as described 
herein. Preferably, polynucleotides of the present invention comprise a promoter 
which functions in Corynebacterium species, preferably in Corynebacterium 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentum, even 
more preferably in C. glutamicum. Assays to determine whether a polynucleotide 
is capable of regulating transcription in Corynebacterium species can be routinely 
performed using techniques described herein and otherwise known in the art. 

Accordingly, the present invention also encompasses the above 
polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 
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first nucleic acid relative to the second nucleic acid is such that, under the 
appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides. "Facilitation of production" includes increasing production 
constitutively, decreasing or blocking production except under specific 
conditions, and/or increasing production except under specific conditions. 
Among the "conditions" contemplated by the present invention are: (1) adding a 
component to a culture medium, (2) removing a component from a culture 
medium, (3) replacing one component of a culture medium with a second 
component, (4) increasing the temperature of the culture medium, (5) decreasing 
the temperature of the culture medium, and (6) regulating the atmospheric 
conditions (e.g. , oxygen or nitrogen concentrations) in which the culture medium 
is maintained. Examples of such conditions are described in more detail, infra. 

[0085] One particularly preferred aspect of this embodiment is a polynucleotide 

comprising a nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID 
NO:13, SEQ ID NO:14, SEQIDNO:15, SEQIDNO:16, SEQIDNO:17, SEQ 
ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:21 , and SEQ ID NO:22; 
and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. Examples of amino acid biosynthesis pathways included 
in this aspect of the invention are the enzymatic pathways that result in the 
synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L-phenylalanine, L- 
tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L-proline, L- valine, 
L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L-asparagine, L- 
aspartic acid, and L-threonine. 

[0086] A preferred amino acid biosynthesis pathway results in the synthesis of 

L-lysine. In this preferred aspect of this embodiment, the second nucleic acid 
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encodes one of the following components of the L-lysine biosynthesis pathway: 
aspartokinase, aspartate beta-semialdehyde dehydrogenase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, dihydrodipicolinate synthetase, 
dihydrodipicolinate reductase, or pyruvate carboxylate. 

The present invention is further directed to a polynucleotide comprising 
a first nucleic acid, the sequence of which comprises about 6 contiguous 
nucleotides, preferably about 10 contiguous nucleotides, even more preferably 
about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to about 500, 
1000, or 1500 contiguous nucleotides, of a sequence selected from the group 
consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
IDNO:8, SEQIDNO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQIDNO:12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22, or the complement of any of these sequences. By a first nucleic acid the 
sequence of which comprises about 1 0 contiguous nucleotides of any of said 
nucleic acid sequences, for example, is intended a nucleic acid which includes 
about 1 0 contiguous bases from any of said nucleotide sequences. Of course, the 
polynucleotide includes at least 10 contiguous nucleotides of any of said nucleic 
acid sequences, and may include about 12, 15, 20, 30, 40, 50, 100, 150, 200, or 
in some cases up to about 500, 1 000, or 1 500 contiguous nucleotides, or the entire 
sequence, of any of said nucleic acid sequences. In this context "about" includes 
the particularly recited size, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. 

Representative examples of polynucleotides of the invention include a 
first nucleic acid the sequence of which comprises, for example, a sequence from 
about nucleotide 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 
351-400, 401-450, or 450-500 of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ IDNO:7, SEQ IDNO:8, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 13, 
SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO: 19, SEQ ID NO:20, SEQ ID NO:21, or SEQ ID NO:22, or the 
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complementary DNA strand thereto; a sequence from about nucleotide 1-50,51- 
100,101-150,151-200,201-250,251-300,301-350,351-400,401-450,450-500, 
501-550, 551-600, 601-650, 651-700, 701-750, 75 1-800, 801-850, 851-900, 901- 
950,951-1000,1001-1050,1051-1100,1101-1150,1151-1200,1201-1250,1251- 
1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500 or 1501-1583 of SEQ ID 
NO:9, or the complementary DNA strand thereto; a sequence from about 
nucleotide 1-50, 51-100, 101-150, 151-200,201-250,251-300, 301-350, 351-400, 
401-450,450-500,501-550,551-600,601-650,651-700, 701-750,751-800,801- 
850, 851-900, 901-950, 951-1000, or 1001-1083 of SEQ ID NO:12, or the 
complementary DNA strand thereto; or a sequence from about nucleotide 1-50, 
51-100, 101-150, 151-200, 201-250, or 251-297 of SEQ ID NO:18, or the 
complementary DNA strand thereto. In this context "about" includes the 
particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. Additional examples of 
polynucleotides of the invention include a first nucleic acid the sequence of which 
comprises a consensus -10 or -35 region as listed in Table 2, infra. 
[0089] This embodiment of the present invention is directed to a polynucleotide 

comprising a first nucleic acid, the sequence of which comprises about 6 
contiguous nucleotides, preferably aboutlO contiguous nucleotides, even more 
preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to 
about 500, 1000, or 1 500 contiguous nucleotides, of a sequence selected from the 
group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, 
SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID 
NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and 
SEQ ID NO:22, or the complement of any of these sequences, irrespective of 
whether they have functional activity. This is because even where a particular 
polynucleotide does not have functional activity, one of skill in the art would still 
know how to use the nucleic acid molecule, for instance, as a hybridization probe, 
an SI nuclease mapping probe, or a polymerase chain reaction (PCR) primer. 
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[0090] Preferred, however, a polynucleotide comprising a first nucleic acid, the 

sequence of which comprises about 6 contiguous nucleotides, preferably about 1 0 
contiguous nucleotides, even more preferably about 12, 15, 20, 30, 40, 50, 100, 
1 50, 200, or in some cases up to about 500, 1000, or 1 500 contiguous nucleotides, 
of a sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 
ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or the complement of any of 
these sequences, which do, in fact, have functional activity, in particular, 
polynucleotides which regulate transcription in Corynebacterium species. 
Examples of polynucleotides which regulate transcription in Corynebacterium 
species include promoters, operators, transcription factor recognitions sites, 
transcriptional complexes, and transcriptional regulatory elements, as described 
herein. 

[009 1] Preferably, polynucleotides of the present invention comprise a promoter 

which functions in Corynebacterium species, preferably in Corynebacterium 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentum, even 
more preferably in C. glutamicum. Assays to determine whether a 
polynucleotide is capable of regulating transcription in Corynebacterium species 
can be routinely performed using techniques described herein and otherwise 
known in the art. 

[0092] Accordingly, the present invention also encompasses the above 

polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 
first nucleic acid relative to the second nucleic acid is such that, under the 
appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides, as described herein. 
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[0093] One particularly preferred aspect of this embodiment is a polynucleotide 

comprising a first nucleic acid, the sequence of which comprises about 6 
contiguous nucleotides, preferably about 10 contiguous nucleotides, even more 
preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to 
about 500, 1000, or 1500 contiguous nucleotides, of a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ IDNO:ll, SEQ ID NO: 12, SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ 
ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , and SEQ ID NO:22; 
and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. A preferred amino acid biosynthesis pathway results in the 
synthesis of L-lysine. Examples of amino acid biosynthesis pathways, and 
specific components of the L-lysine biosynthesis pathway are described herein. 

[0094] The nucleotide sequences of functional bacterial promoters exhibit a 

limited but significant degree of variability depending on the gene that is 
regulated by the particular promoters in question, and the bacterial species in 
which the promoter is found. See, e.g., Patek et ah, Microbiology 142:1297 
(1996), which is incorporated herein by reference in its entirety. Despite this 
variability, investigators have identified promoter consensus sequences by 
comparing the nucleotide sequences of multiple bacterial promoters to one 
another. A consensus sequence is a sequence that reflects the most common 
nucleotides that are found in particular positions for a multitude of promoters. 
Frequently, the functionality of a promoter consensus sequence is confirmed by 
mutational analysis in an experimental system; i.e., the expression of a reporter 
gene that is under the control of a mutated promoter is assessed. 

[0095] Early experiments and analyses of E. coli promoters identified a 

consensus sequence that comprises two nucleotide hexamers (six contiguous 
nucleotides) separated by about 17 nucleotides, and located about 35 and 10 
nucleotides, respectively, upstream of the transcriptional start site for the 
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corresponding gene. Patek et al., Microbiology \A2:\291 (1996). These two 
hexamers are known individually as the -35 and -10 regions, and their respective 
consensus sequences are TTGaca, and TAtaaT. Within a consensus sequence, 
capital letters denote particular nucleotides that are found in analogous positions 
in at least 70% of the promoter sequences analyzed. Lower case letters denote 
nucleotides that are found in analogous positions in between 42% to 70% of the 
promoter sequences analyzed. 
[0096] Recently, Patek et al. analyzed the sequences of 18 promoters from C. 

glutamicum, and four promoters from the temperate corynephage #>GA1, to 
identify a promoter consensus sequence specific to C. glutamicum. From their 
analysis, Patek et al. identified a -35 consensus sequence of ttGcca, and a -10 
consensus sequence of TA.aaT. The period (".") at the third position of the -10 
consensus sequence indicates that the nucleotide at this position does not 
establish a consensus among the C. glutamicum promoters that were compared 
by Patek et al. 

[0097] Accordingly, in one aspect of this embodiment of the present invention, 

a polynucleotide is provided which comprises a -10 promoter consensus 
sequence, and/or a -35 consensus sequence derived from a sequence selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO: 8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ 
ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 
SEQ ID NO:17, SEQ ID NO: 18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID 
NO:21, and SEQ ID NO:22. While not being bound by theory, such sequence 
motifs may function as part of a promoter capable of regulating transcription in 
Corynebacterium species, preferably in Corynebacterium glutamicum, 
Brevibacterium flavum, oxBrevibacterium lactofermentum, even more preferably 
in C. glutamicum. 

[0098] To identify -10 promoter and/or -35 promoter consensus sequences, 

computer analyses of SEQ ID NOs: 1 through 22 were conducted using the 
prokaryotic nucleic acid motif search functionality provided by the OMIGA 2.0 
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computer software package. In addition, SEQ ID NOs: 1 through 22 were 
analyzed visually to identify any sequences that agree with the consensus 
sequences identified by Patek et al. A summary of predicted -10 and -35 
consensus sequences identified for SEQ ID NO: 1 through 22 is listed in Table 2. 
It is to be noted that in the numbering convention of SEQ ID NOs 1 through 22, 
all nucleotides have a positive number and the numbers increase. This is in 
contrast to the normal convention for promoter regions, wherein the nucleotides 
are assigned negative numbers which increase up to nucleotide number 1 at the 
putative transcription start site. 
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TABLE 2 



SEQIDNO: 


-10 sequence(s) 


Position 


-35 sequence(s) 


Position 


1 






TTGAGA* 


375 


1 






TTGAAA* 


386 


3 






TTGAAA* 


339 


4 


TACAAT* 


275 






4 






TTGAGA* 


57 


4 






TTGAAA* 


186 


4 






TTGAAA* 


301 


4 






TTGCCA J 


466 


5 






TTGAAA* 


307 


5 






TTGAGA* 


340 


7 


TACAAT* 


334 






7 - 






TTGCCA* 


224 


8 






TTGATA* 


341 


8 






TTGCCA* 


371 


9 






TTGATA* 


803 


10 


TACAAT* 


406 






10 






TTGACA* 


234 


11 






TTGAAA* 


248 


13 


TACAAT* 


394 






13 






TTGCCA* 


118 


14 


TAAAAT* 


169 






15 


TAAAAT* 


379 






17 






TTGACA* 


211 


18 






TTGAAA* 


31 


18 






TTGAAA* 


65 


18 


TAAAAT* 


269 






18 






TTGCCA* 


168 


20 






TTGAAA* 


490 


21 






TTGAAA* 


41 


21 






TTGAGA* 


445 


21 






TTGAAA* 


494 



* consensus sequences identified by the OMIGA 2.0 program 

J consensus sequences identified visually by comparison to consensus sequences set 
forth in Patek et al. 



-41- 



In another embodiment, the invention provides an isolated polynucleotide 
comprising a first nucleic acid at least 1 0 nucleotides in length, preferably at least 
12, 15, 30, 50 or 150 nucleotides in length, which hybridizes under stringent 
conditions to a reference nucleic acid, or the complement thereof, wherein the 
sequence of said reference nucleic acid is selected from the group consisting of 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ 
ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ IDNO:12, SEQ ID NO: 1 3, SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22. By 
"stringent hybridization conditions" is intended overnight incubation at 42 °C in 
a solution comprising: 50% formamide, 5x SSC (750 mMNaCl, 75mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 1 0% dextran 
sulfate, and 20 [ig/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in 0. lx SSC at about 65 °C. 
| By a first nucleic acid "at least 1 0 nucleotides in length," which hybridizes 

to a reference nucleic acid is intended a first nucleic acid (either DNA or RNA) 
which hybridizes to 10 or more contiguous nucleotides from the nucleotide 
sequence of the reference nucleic acid {e.g., SEQ ID NOs 4-22). 
] This embodiment of the present invention is directed to an isolated 

polynucleotide comprising a first nucleic acid at least 10 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is selected from the group 
consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
IDNO:8,SEQIDNO:9,SEQIDNO:10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22, or the complement of any of these sequences, irrespective of whether they 
have functional activity. This is because even where a particular polynucleotide 
does not have functional activity, one of skill in the art would still know how to 
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use the nucleic acid molecule, for instance, as a hybridization probe, an SI 
nuclease mapping probe, or a polymerase chain reaction (PCR) primer. 

Preferred, however, an isolated polynucleotide comprising a first nucleic 
acid at least 10 nucleotides in length, preferably at least 12, 15, 30, 50 or 150 
nucleotides in length, which hybridizes under stringent conditions to a reference 
nucleic acid, or the complement thereof, wherein the sequence of said reference 
nucleic acid is selected from the group consisting of SEQ ID NO: 4, SEQ ID 
NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 1 0, SEQ ID NO : 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3 , SEQ ID NO: 1 4, SEQ 
ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO: 18, SEQ ID NO:19, 
SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or the complement of any 
of these sequences, which do, in fact, have functional activity, in particular, 
polynucleotides which regulate transcription in Corynebacterium species. 
Examples of polynucleotides which regulate transcription in Corynebacterium 
species include promoters, operators, transcription factor recognitions sites, 
transcriptional complexes, and transcriptional regulatory elements, as described 
herein. Preferably, polynucleotides of the present invention comprise a promoter 
which functions in Corynebacterium species, preferably in Corynebacterium 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentum, even 
more preferably in C. glutamicum. Assays to determine whether a 
polynucleotide is capable of regulating transcription in Corynebacterium species 
can be routinely performed using techniques described herein and otherwise 
known in the art. 

] Accordingly, the present invention also encompasses the above 

polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 
first nucleic acid relative to the second nucleic acid is such that, under the 
appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides, as described herein. 
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One particularly preferred aspect of this embodiment is an isolated 
polynucleotide comprising a first nucleic acid at least 10 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is selected from the group 
consisting of selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, 
SEQ IDNO:3, SEQ IDNO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ IDNO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22; and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. A preferred amino acid biosynthesis pathway results in the 
synthesis of L-lysine. Examples of amino acid biosynthesis pathways, and 
specific components of the L-lysine biosynthesis pathway are disclosed herein. 

Another embodiment of the invention provides an isolated polynucleotide 
comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2, and a 
second nucleic acid which encodes a polypeptide which functions as a component 
of the lysine biosynthesis pathway. Alternatively in this embodiment, the 
sequence of the first nucleic acid comprises about 6 contiguous nucleotides, 
preferably aboutlO contiguous nucleotides, even more preferably about 12, 15, 
20, 30, 40, 50, 100, 150, 200, or 500, contiguous nucleotides, of SEQ ID NO:2, 
or the first nucleic acid is at least 10 nucleotides in length, preferably at least 12, 
1 5, 30, 50 or 1 50 nucleotides in length, and hybridizes under stringent conditions 
to a reference nucleic acid, or the complement thereof, wherein the sequence of 
said reference nucleic acid is SEQ ID NO:2. A preferred amino acid biosynthesis 
pathway results in the synthesis of L-lysine. Examples of components of the L- 
lysine biosynthesis pathway are disclosed herein. In this embodiment, the first 
nucleic acid, which comprises the C. glutamicum aceA promoter, regulates 
transcription of the second nucleic acid. 
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Yet another embodiment of the invention provides an isolated 
polynucleotide comprising a first nucleic acid, the sequence of which is at least 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO: 3; and a second nucleic acid which encodes polypeptide which functions as 
a component of an amino acid biosynthesis pathway. Alternatively in this 
embodiment, the sequence of the first nucleic acid comprises about 6 contiguous 
nucleotides, preferably about 10 contiguous nucleotides, even more preferably 
about 12, 15, 20, 30, 40, 50, 100, 150, 200, or 500, contiguous nucleotides, of 
SEQ ID NO: 3, or the first nucleic acid is at least 10 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, and hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is SEQ ID NO:3. A 
preferred amino acid biosynthesis pathway results in the synthesis of L-lysine. 
Examples of amino acid biosynthesis pathways, and specific components of the 
L-lysine biosynthesis pathway are disclosed herein. In this embodiment, the first 
nucleic acid, which comprises the C glutamicum aceB promoter, regulates 
transcription of the second nucleic acid. 

A further embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, or 99% identical to SEQ ID NO:l. By "an isolated 
Corynebacterium species chromosome" is meant a bacterial chromosome which 
has been altered in some way to affect gene expression, either of naturally- 
occuring genes or heterologous genes, and then isolated from other non-altered 
chromosomes, e.g., by selection and isolation of a host cell harboring the 
chromosome. In this embodiment, the isolated Corynebacterium species 
chromosome also has a second nucleic acid integrated therein, wherein said 
second nucleic acid encodes polypeptide which functions as a component of an 
amino acid biosynthesis pathway. A preferred amino acid biosynthesis pathway 
results in the synthesis of L-lysine. Examples of amino acid biosynthesis 
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pathways, and specific components of the L-lysine biosynthesis pathway are 
disclosed herein. In this embodiment, the first nucleic acid, which comprises the 
C. glutamicum pta promoter, regulates transcription of the second nucleic acid. 

Another embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, or 99% identical to SEQ ID NO:23 . In this embodiment, the 
isolated Corynebacterium species chromosome also has a second nucleic acid 
integrated therein, wherein said second nucleic acid encodes polypeptide which 
functions as a component of an amino acid biosynthesis pathway, preferably an 
L-lysine biosynthesis pathway. In this embodiment, the first nucleic acid, which 
comprises the E. coli araBAD promoter and the araC regulator molecule, 
regulates transcription of the second nucleic acid. The araC/ araBAD 
transcriptional complex is a tightly regulated system responsive to the regulator 
molecule arabinose. 

Yet another embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is identical to either SEQ ID NO:26 or SEQ 
ID NO:27. As with the embodiments described above, the isolated 
Corynebacterium species chromosome also has a second nucleic acid integrated 
therein, wherein said second nucleic acid encodes polypeptide which functions 
as a component of an amino acid biosynthesis pathway, preferably an L-lysine 
biosynthesis pathway, and the first nucleic acid, which is a trc or tac synthetic 
promoter, regulates transcription of the second nucleic acid. These synthetic 
promoters combine the -3 5 region from the trp promoter with the - 1 0 region from 
the ZacUV5 promoter. See Brosius, J. et al, J. Biol. Chem. 260, 3539 (1985). 
In the embodiment, the isolated chromosome further has a third nucleic acid 
integrated therein, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:28. This third nucleic 
acid is operably linked to a transcription control region, and the nucleic acid 
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encodes a polypeptide at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, or 99% identical to SEQ ID NO:29, the E. coli lac repressor gene. The 
\acl q -trc (SEQ ID NO:30) or the lacl q -tac transcriptional complex is responsive 
to the regulatory molecules isoproplythiogalactoside (IPTG) and lactose. This 
regulatory complex in Corynebacterium glutamicum, however, is leaky, i.e., 
transcription occurs in the absence of any added inducer. 

In each of the isolated chromosomes described above, the second nucleic 
acid may be a naturally occurring Corynebacterium nucleic acid, situated in its 
normal position on the chromosome, it may be a naturally occurring 
Corynebacterium nucleic acid which has been moved to a different position on 
the chromosome, or it may be a non-native nucleic acid. The first nucleic acid is 
engineered to be in a physical location which allows it to regulate transcription 
of the second nucleic acid either through in vitro cloning of a vector having both 
the first and second nucleic acids followed by homologous or random 
recombination into the Corynebacterium species chromosome, or it is inserted in 
front of a selected second nucleic acid already situated in the chromosome 
through homologous recombination. Such manipulations are readily understood 
by those of ordinary skill in the art. 

In another aspect of the above embodiments, a Corynebacterium species 
host cell comprising a Corynebacterium species chromosome as described above 
is provided, preferably a host cell derived from Corynebacterium glutamicum, 
Brevibacterium flavum, and Brevibacterium lactofermentum, and even more 
preferably derived from Corynebacterium glutamicum. In a related aspect, a 
method of producing such a Corynebacterium species host cell is provided, which 
comprises transforming Corynebacterium species cells with a vector comprising 
a first nucleic acid as described above, wherein said vector facilitates integration 
of the first nucleic acid into the chromosome of said Corynebacterium species 
cells, and selecting the host cell in certain preferred embodiments, the vector also 
comprises the second nucleic acid described above, and/or the third nucleic acid 
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described above, with the first nucleic acid being physically situated to regulate 
transcription of the second nucleic acid. These vectors are also provided. 

[0112] The present invention further relates to variants of the polynucleotides of 

the present invention. Variants may occur naturally, such as a natural allelic 
variant. By an "allelic variant" is intended one of several alternate forms of a 
gene occupying a given locus on a chromosome of an organism. Genes II, Lewin, 
B., ed., John Wiley & Sons, New York (1985). Non-naturally occurring variants 
may be produced using art-known mutagenesis techniques. 

[0113] Such variants include those produced by nucleotide substitutions, 

deletions or additions which may involve one or more nucleotides. Particularly 
preferred are variants which alter the level of activity of the polynucleotide 
relative to a non-variant polynucleotide. For example, variants which increase or 
decrease the strength of a promoter, or variants which alter or eliminate the need 
for an inducer. Methods to construct variant promoters and screen them for their 
relative strengths utilizing reporter genes are well known in the art. See, e.g., 
Vasicova, etal.,J. Bacteriol. 181 :61 88-6199 (1 999), which is incorporated herein 
by reference in its entirety. 

D. Vectors and Host Cells 

[0114] The present invention also relates to vectors which include 

polynucleotides of the present invention, host cells which are engineered with 
vectors of the invention and methods to increase the production of metabolic 
products such as amino acids purines, pyrimidines, and heterologous polypeptides 
through utilization of host cells of the invention. 

[0115] Host cells can be engineered to incorporate polynucleotides of the present 

invention. The polynucleotides may be introduced alone or with other 
polynucleotides. Such other polynucleotides may be introduced independently, 
co-introduced or introduced joined to the polynucleotides of the invention. 
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[0116] In accordance with this aspect of the invention the vector may be, for 

example, a plasmid vector or a bacteriophage vector. Such vectors may be 
introduced into Corynebacterium cells such that they are maintained and 
replication as an extrachromosomal element, or, alternatively, such that they 
integrate into the chromosome. Vectors are introduced into host cells by well 
known techniques for introducing DNA and RNA into prokaryotic cells, 
examples of which are described in the Examples, infra. 

[0117] Preferred vectors comprise a first nucleic acid which is capable of 

regulating transcription of a second nucleic acid which is inserted in operable 
association, preferably immediately downstream, of the first nucleic acid. The 
second nucleic acid preferably encodes a polypeptide such as an enzyme in an 
amino acid biosynthesis pathway. Especially preferred among vectors are those 
which allow convenient insertion of the second nucleic acid into the vector into 
the proper position to be regulated by the first nucleic acid, for example, through 
the use of a multiple cloning region. Appropriate trans-acting factors such as 
inducers either are supplied by the host cell, supplied by a complementing vector 
or supplied by the vector itself upon introduction into the host cell. 

[0118] A great variety of vectors can be used in the invention. Such vectors 

include chromosomal, episomal and virus-derived vectors e.g., vectors derived 
from bacterial plasmids and from bacteriophage, as well as vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage 
genetic elements, such as cosmids and phagemids, all may be used in accordance 
with this aspect of the present invention. Generally, any vector suitable to 
maintain and propagate a polynucleotide in a bacterial host may be used in this 
regard. 

[0119] A large numbers of suitable vectors for use in bacteria are known, many 

of which are commercially available. Preferred prokaryotic vectors include 
plasmids such as those capable of replication in E. coli (such as, for example, 
pBR322, ColEl, pSClOl, pACYC 184, ttVX. Such plasmids are, for example, 
disclosed by Maniatis, T., et al, In: Molecular Cloning, A Laboratory Manual, 
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Cold Spring Harbor Press, Cold Spring Harbor, NY (1982)). The following 
vectors are provided by way of example: pET (Novagen), pQE70, pQE60, pQE-9 
(Qiagen), pBs, phagescript, psiX174, pBlueScript SK, pBsKS, pNH8a, pNH16a, 
pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5 (Pharmacia). 

[0120] Preferred vectors for the isolated polynucleotides of the invention include 

the pFC 1 to pFC7 novel family of combinatorial cloning vectors (Lonsdale, D .M., 
et al. , Plant Molecular Biology Reporter 13: 343-345 (1 995)), the pKl 84 vector 
(Jobling, M.G. and Homes, R.K., Nucleic Acid Research 18: 53 1 5-53 16 (1990)). 

[0121] Another group of preferred vectors are those that are capable of 

autonomous replication in Corynebacterium species. Such vectors are well 
known to those skilled in the art of amino acid production by way of microbial 
fermentation, examples of which include pSRl, pMF1014a and vectors derived 
therefrom. 

[0 122] The second nucleic acid encoding a polypeptide is operatively associated 

with the first nucleic acid as described above, which provides the appropriate 
transcription control sequence(s)), in particular, a promoter. In addition, the 
vector may contain sites for transcription initiation and termination, and, in the 
transcribed region, a ribosome binding site for translation. The second nucleic 
acid, which provides the coding region of the polypeptide to be regulated will 
include a translation initiating AUG or GUG at the beginning and a termination 
codon (UAA, UGA or UAG) appropriately positioned at the end of the coding 
region. 

[0123] In addition, and as described in more detail herein, the vector may contain 

control regions that regulate as well as engender expression. Generally, such 
regions will operate by controlling transcription, such as inducer or repressor 
binding sites and enhancers, among others. 

[0124] Vectors of the present invention generally will include a selectable 

marker. Such markers also may be suitable for amplification or the vectors may 
contain additional markers for this purpose. In this regard, vectors preferably 
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contain one or more selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Such markers include, but are not limited to, 
an antibiotic resistance gene such as a chloramphenicol, ampicillin, or kanamycin 
resistance gene, or an autotrophic gene which allows the host cell to grow in the 
absence of a nutrient for which the host cell strain is normally autotrophic. 

If the vector is intended to be maintained in the host cell 
extrachromosomally, it will contain, in addition and origin of replication which 
will allow it to replicate in the Corynebacterium species host cell. Alternatively, 
if it is desired that the vector integrate into the Corynebacterium species 
chromosome, the vector is constructed such that it cannot replicate in 
Corynebacterium. For example, such a vector might be capable of propagation 
in another organism, for example, E. coli, but lack the proper origin of replication 
to be propagated in Corynebacterium. In another aspect of this embodiment, the 
vector is a shuttle vector which can replicate and be maintained in more than one 
host cell species, for example, such a shuttle vector might be capable of 
replication in a Corynebacterium host cell such as a C glutamicum host cell, and 
also in an E. coli host cell. 

A vector comprising a polynucleotide as described elsewhere herein may 
be introduced into an appropriate Corynebacterium species host cell using a 
variety of well known techniques, for example, transformation or electroporation. 
Appropriate culture mediums and conditions for culturing Corynebacterium host 
cells are known in the art. 

The invention provides methods for increasing amino acid production and 
processes for the production of an amino acid wherein increased biosynthetic 
pathway gene expression is accomplished through insertion of an isolated 
polynucleotide into the chromosome of a host cell. For example, insertion of 
isolated polynucleotides into the chromosome of Corynebacterium species may 
be done utilizing the pK184 plasmid described by Jobling, M. et al., Nucleic 
Acids Research 18(17): 5315-5316 (1990). Because these vectors lack a 
Corynebacterium species origin of replication and contains a selectable marker 
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such as kanamycin (kari), cells will only be capable of growing under selection 
if the vector has been inserted into the host cell chromosome by homologous 
recombination. 

[0128] The invention also provides methods for increasing amino acid production 

and processes for the production of an amino acid wherein increased biosynthetic 
pathway gene expression is accomplished through the introduction into a host cell 
of a self-replicating, extra-chromosomal vector, e.g., a plasmid, comprising an 
isolated nucleic acid molecule encoding an amino acid biosynthetic pathway gene 
or genes. Suitable plasmids for these embodiments include pSRl and other 
derivatives of pSRl (Archer, J. etal, J. Gen. Microbiol. 139: 1753-1759 (1993)). 

[0129] These vectors are listed solely by way of illustration of the many vectors 

available to those of skill in the art. Other vectors are described in the Examples, 
infra. Selection of appropriate vectors to transform into a host cell is a well 
known procedure and the requisite techniques for vector construction, 
introduction of the vector into the host and maintenance in the host are routine 
skills in the art. 

[0130] The present invention also relates to Corynebacterium species host cells 

comprising the above-described vector constructs described herein. The host cell 
can be any Corynebacterium species, preferably Corynebacterium glutamicum, 
Brevibacteriumflavum, or Brevibacterium lactofermentum, even more preferably 
a strain of C. glutamicum. Preferred strains of C. glutamicum to use as host cells 
of the invention include: NRRL-B11474, ATCC 21799, ATCC 21529, ATCC 
21543, andE12. 

[0131] Introduction of a vector into the host cell can be effected by 

transformation, electroporation, transduction, infection or other methods. Such 
methods are described in many standard laboratory manuals, such as Davis et al, 
Basic Methods in Molecular Biology (1986). 
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E. Methods and Processes of the Invention 

[0132] Various embodiments of the invention provide methods to increase the 

production of an amino acid and processes for the production of an amino acid 
from a Corynebacterium species host cell. Particularly preferred 
Corynebacterium species of the methods and processes of the invention include: 
Corynebacterium glutamicum, Brevibacterium flavum, Brevibacterium 
lactofermentum and other Cornynebacteria and Brevibacteria species known in 
the art. 

[0133] Amino acid biosynthetic pathway genes embodied by the methods and 

processes described herein include those for L-glycine, L-alanine, L-methionine, 
L-phenylalanine, L-tryptophan, L-proline, L-serine, L-threonine, L-cysteine, L- 
tyrosine, L-asparagine, L-glutamine, L-aspartic acid, L-glutamic acid, L-lysine, 
L-arginine, L-histidine, L-isoleucine, L-leucine, and L-valine biosynthesis. 
Particularly preferred embodiments are drawn to biosynthetic pathway genes for 
L-lysine (Sahm et al,Ann. N. Y. Acad. Set. 782: 25-39 (1996)), L-threonine, L- 
isoleucine, L-tryptophan, and L-valine. 

[0134] By way of example, the amino acid pathway for L-lysine biosynthesis is 

well known to skilled artisans of amino acid production in Corynebacterium 
species. Genes encoding the enzymes important for the conversion of L-aspartate 
to L-lysine include the ash asd, dapA, dapB, ddh, and lysA genes, which encode, 
respectively, aspartokinase, aspartate beta-semialdehyde dehydrogenase, 
dihydrodipicolinate synthetase, dihydrodipicolinate reductase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, and pyruvate carboxylase. Thus, 
the invention provides herein for exemplary purposes only, specific embodiments 
utilizing L-lysine biosynthetic pathway genes. Other embodiments drawn to the 
use of biosynthetic pathway genes for the synthesis of other amino acids are also 
encompassed by the invention described herein. 

[0135] The methods to increase the production of a biosynthetic product such as 

an amino acid and the processes for the production of the biosynthetic product of 
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the invention involve increasing the expression of at least one biosynthetic 
pathway gene. Preferred methods of increasing expression comprise using 
promoters of the invention. 

Accordingly, the present invention provides a first nucleic acid which 
preferably comprises a promoter. Preferred promoters include, but are not limited 
to the following types of promoters: a constitutive promoter, an inducible 
promoter, a derepressable promoter, a heat sensitive promoter, and a cold 
sensitive promoter. In this aspect of the invention, a constitutive promoter causes 
the second nucleic acid to be transcribed at a substantially constant rate; an 
inducible promoter will be non-functional or will function at a reduced level 
under certain standard culture conditions, but causes elevated transcription of the 
second nucleic acid when the transformed host cells are grown in the presence of 
an appropriate inducer substance; a derepressable promoter causes elevated 
transcription of the second nucleic acid when the transformed host cells are 
grown in the absence of an appropriate suppressor substance, but will be non- 
functional or will function at a reduced level when the suppressor substance is 
present; a heat sensitive promoter causes elevated transcription of the second 
nucleic acid when the transformed host cells are grown in medium the 
temperature of which is adj usted to a temperature that is greater than the optimum 
temperature for wild-type untransformed host cells; and a cold sensitive promoter 
causes elevated transcription of the second nucleic acid when said transformed 
host cells are grown in a medium the temperature of which is adjusted to a 
temperature that is less than the optimum temperature for a wild-type 
untransformed host cell. 
] Also included within this embodiment of the invention are methods 

wherein the transformed host cells are grown to a sufficient cell density according 
to methods well known in the art. Further included in this embodiment are 
methods for substantially purifying the biosynthetic product from said culture, 
wherein said purification methods are also well known in the art. 
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[0138] In certain embodiments of the present invention, an inducible promoter 

is used. When an inducible promoter is used, it is preferred that the transformed 
Corynebacterium species host cells are first allowed to proliferate to a sufficient 
cell density in the absence of the inducer, followed by addition of the inducer to 
the culture medium thereby stimulating production of said product. Examples of 
inducers include, but are not limited to one or more of the following substances: 
acetic acid, pyruvate, ethanol, fatty acids, cellulose subunits, starch subunits, 
triglycerides, or any of the following sugars: fructose, maltose, lactose, or 
arabinose. 

[0139] In other embodiments, a derepressable promoter is used. When a 

derepressable promoter is used, it is preferred that the transformed 
Corynebacterium species host cells are first allowed to proliferate to a sufficient 
cell density in the presence of the suppressor, followed by either removal of the 
suppressor from the culture, or replacement of the medium containing the 
suppressor with medium lacking the suppressor, thereby stimulating production 
of said product. Under certain conditions, the suppressor is removed from the 
medium by being metabolized by the host cell. Examples of said suppressors 
include, but are not limited to any of the following substances: a purine, a 
pyrimidine, oxygen, or a suppressor amino acid including histidine. 

[0140] In other embodiments, the promoter is self-inducing, i.e., the inducer 

molecule is produced by the host cell and builds up through the course of the 
growth, thereby changing gene expression as growth progresses. In this 
embodiment, the promoter is induced either gradually or at some point in the 
growth of the host cell culture, however this induction does not require the 
addition of exogenous compounds to the culture medium. 

[0141] In yet another embodiment, a heat sensitive promoter is used. When a 

heat sensitive promoter is used, it is preferred that the transformed host cells are 
first allowed to proliferate to a sufficient cell density in culture medium, the 
temperature of which is adjusted to the optimum growth temperature for 
untransformed host cells, followed by adjusting the temperature of said culture 
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medium to a temperature that is greater than the optimum growth temperature for 
untransformed host cells. 

[0142] In yet another embodiment, a cold sensitive promoter is used. When a 

cold sensitive promoter is used, it is preferred that the transformed host cells are 
first allowed to proliferate to a sufficient cell density in culture medium, the 
temperature of which is adjusted to the optimum growth temperature for 
untransformed host cells, followed by adjusting the temperature of said culture 
medium to a temperature that is less than the optimum growth temperature for 
untransformed host cells. 

[0143] In a related embodiment of the invention, a method is provided for the 

production of a product such as an amino acid, a purine nucleotide, or a 
heterologous polypeptide in a Corynebacterium species host cell utilizing a 
heterologous transcriptional complex. According to this embodiment, the host 
cell harbors the heterologous transcriptional complex integrated into the bacterial 
chromosome. Preferred heterologous transcriptional complexes include the E. 
coli araC-araBAD transcriptional complex (SEQ ID NO:23), and the lacl q -?rc or 
lad q -tac transcriptional complexes which combine the E. coli lacl q repressor gene 
(lacl q , SEQ ID NO:28) with either the trc promoter (SEQ ID NO:26 or SEQ ID 
NO:30) or the tac promoter (SEQ ID NO:27). These synthetic promoters 
combine the -35 region from the trp promoter with the -10 region from the 
/acUV5 promoter. See Brosius, J. et al, J. Biol. Chem. 260, 3539 (1985). 

[0144] The method utilizing the E. coli araC-araBAD transcriptional complex 

preferably further comprises culturing the host cell in or on a medium to which 
the inducer arabinose has been added, either throughout growth of the host cells, 
or at a point during growth where the host cell culture has reached an optimal 
density. The method utilizing either the lacl q -/rc or the lacF-tac transcriptional 
complex may further comprise culturing the host cell in or on a medium to which 
the inducer isoproplythiogalactoside (IPTG) or the inducer lactose has been 
added, either throughout growth of the host cells, or at a point during growth 
where the host cell culture has reached an optimal density. However, in C. 
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glutamicum, the lacTltrc system is leaky, therefore an inducer is not required for 
optimal growth. 

[0145] A preferred aspect of this embodiment provides a method for the 

production of an amino acid in a Corynebacterium host cell utilizing a 
heterologous transcriptional complex integrated into the host cell chromosome, 
where the heterologous transcriptional complex operably regulates production a 
polypeptide which functions in an amino acid biosynthesis pathway. In this 
aspect, the heterologous transcriptional complex, when integrated into the host 
cell chromosome, regulates transcription of one or more metabolic pathway 
enzymes, thereby resulting in elevated production of the amino acid by the 
transformed host cell. 

[0146] One aspect of this embodiment involves addition of an inducer to the 

culture medium thereby upregulating the heterologous transcriptional complex. 
In the method utilizing the E. coli araC-araBAD transcriptional complex 
arabinose is added to the medium. In the method utilizing the lad q -frr or the 
lacl q -tac transcriptional complex, IPTG and/or lactose is added to the culture 
medium. Preferably in this method, the host cells are grown to a sufficient cell 
density according to methods well known in the art prior to addition of the 
inducer molecule. 

[0147] For those embodiments of the invention drawn to a method to increase 

production of an amino acid, screening for increased production of an amino acid, 
for example L-lysine, may be determined by directly comparing the amount of L- 
lysine produced in culture by a standard Corynebacterium species strain to that 
of a Corynebacterium species host cell which has been transformed to comprise 
an isolated polynucleotide of the present invention. The level of production of 
the amino acid of choice may conveniently determined by the following formula 
to calculate the percent yield from dextrose: [(g amino acid/L / (g dextrose 
consumed/L)] *100. 

[0148] A variety of media known to those skilled in the art may be used to 

support cell growth for the production of an amino acid. Illustrative examples of 
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suitable carbon sources include, but are not limited to: carbohydrates, such as 
glucose, fructose, sucrose, starch hydrolysate, cellulose hydrolysate and molasses; 
organic acids, such as acetic acid, propionic acid, formic acid, malic acid, citric 
acid, and fumaric acid; and alcohols, such as glycerol. Illustrative examples of 
suitable nitrogen sources include, but are not limited to: ammonia, including 
ammonia gas and aqueous ammonia; ammonium salts of inorganic or organic 
acids, such as ammonium chloride, ammonium phosphate, ammonium sulfate and 
ammonium acetate; and other nitrogen-containing sources, including meat 
extract, peptone, corn steep liquor, casein hydrolysate, soybean cake hydrolysate, 
urea and yeast extract. 
[0149] A variety of fermentation techniques are known in the art which may be 

employed in processes of the invention drawn to the production of amino acids. 
Generally, amino acids may be commercially produced from the invention in 
fermentation processes such as the batch type or of the fed-batch type. In batch 
type fermentations, all nutrients are added at the beginning of the fermentation. 
In fed-batch or extended fed-batch type fermentations one or a number of 
nutrients are continuously supplied to the culture, right from the beginning of the 
fermentation or after the culture has reached a certain age, or when the nutrient(s) 
which are fed were exhausted from the culture fluid. A variant of the extended 
batch of fed-batch type fermentation is the repeated fed-batch or fill-and-draw 
fermentation, where part of the contents of the fermenter is removed at some 
time, for instance when the fermenter is full, while feeding of a nutrient is 
continued. In this way a fermentation can be extended for a longer time. 
[0 150] Another type of fermentation, the continuous fermentation or chemostat 

culture, uses continuous feeding of a complete medium, while culture fluid is 
continuously or semi-continuously withdrawn in such a way that the volume of 
the broth in the fermenter remains approximately constant. A continuous 
fermentation can in principle be maintained for an infinite time. 
[0151] In a batch fermentation an organism grows until one of the essential 

nutrients in the medium becomes exhausted, or until fermentation conditions 
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become unfavorable (e.g. the pH decreases to a value inhibitory for microbial 
growth). In fed-batch fermentations measures are normally taken to maintain 
favorable growth conditions, e.g. by using pH control, and exhaustion of one or 
more essential nutrients is prevented by feeding these nutrient(s) to the culture. 
The microorganism will continue to grow, at a growth rate dictated by the rate of 
nutrient feed. Generally a single nutrient, very often the carbon source, will 
become limiting for growth. The same principle applies for a continuous 
fermentation, usually one nutrient in the medium feed is limiting, all other 
nutrients are in excess. The limiting nutrient will be present in the culture fluid 
at a very low concentration, often unmeasurably low. Different types of nutrient 
limitation can be employed. Carbon source limitation is most often used. Other 
examples are limitation by the nitrogen source, limitation by oxygen, limitation 
by a specific nutrient such as a vitamin or an amino acid (in case the 
microorganism is auxotrophic for such a compound), limitation by sulphur and 
limitation by phosphorous. 
[0152] The amino acid may be recovered by any method known in the art. 

Exemplary procedures are provided in the following: Van Walsem, H.J. & 
Thompson, M.C.,J. Biotechnol. 59:127-1 32 (1997), and U.S. Pat. No. 3,565,951, 
both of which are incorporated herein by reference. 

EXAMPLES 

EXAMPLE 1 

Production of Host Cells Utilizing Transcriptional Regulatory Regions of the 
Present Invention 

[0153] Corynebacterium species host cells, in which to express metabolic 

polypeptides are constructed as follows. Polynucleotides of the present invention 
which comprise transcriptional regulatory regions related to SEQ ID NOs 1-23, 
26-28, and 30 are inserted upstream of a multiple cloning region, thereby 
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allowing convenient insertion coding regions to be regulated and expressed. 
Testing of the various transcriptional regulatory regions is conveniently carried 
by inserting the various regulatory regions in operable association to a known 
reporter gene, such as P-galactosidase (lacZ). Methods and techniques common 
to the art of recombinant DNA technology are used in making vectors of the 
invention, as may be found in the many laboratory manuals cited and incorporated 
herein, for example as found in J. Sambrook, E.F. Fritsch and T. Maniatis, 
Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). Construction of 
exemplary vectors is set forth in Example 2, infra. 

For cell transformation experiments with the isolated nucleic acid 
molecules of the invention, the growth and preparation of competent cells may 
be done according to the following procedure: (1) picking a fresh, single colony 
of Corynebacterium glutamicum and growing a culture overnight in 10 mL CM 
(SMI) in a 250 mL shake flask at 30 degrees Celsius with agitation; (2) 
inoculating 200 mL of "Growth Media" with the overnight culture to an optical 
density (O.D.) of 660 nm of 0.1 in a 500 mL shake flask; (3) growing the culture 
at 30 degrees Celsius with agitation for 5-6 hours; (4) pouring the culture into a 
chilled, sealed, sterile 250 mL centrifuge bottle; Spin at 8-1 OK for ten minutes 
in Refrigerated Sorvall at 4 degrees Celsius; (5) pouring off the supernatant 
thoroughly and resuspending the cell pellet in an equal volume of ice-cold, 
sterile, deionized water; (6) centrifuging the sample again under the same 
conditions; (7) repeating the water wash remembering to keep everything 
ice-cold; (8) pouring off the supernatant thoroughly and resuspending the cell 
pellet in 1 mL of ice-cold, sterile 10% glycerol and transferring the cells to a 
chilled, sterile, 1 .5 mL microcentrifuge tube; (9) spin the sample for 10 minutes 
in a refrigerated centrifuge; (10) pipetting off and discarding the supernatant, and 
resuspending the pellet in two to three times the pellet volume (200-400 uL) of 
10% glycerol; and (1 1) aliquoting, if necessary, the cells into chilled tubes and 
freezing at -70 Celsius. 
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[0155] Plasmid DNAs are introduced into Corynebacterium glutamicum host 

cells by the following electroporation procedure : ( 1 ) pipetting 3 5 uL cell/glycerol 
solution onto the side wall of a chilled 0.1 cm electrocuvette; (2) pipetting about 
2-4 [iL of plasmid into the solution and mixing the sample by gentle pipetting up 
and down; (3) bringing the entire solution to the bottom of the electrocuvette by 
gentle tapping, avoiding the creation of bubbles; (4) keeping the sample on ice 
until ready for the electroshock step, wiping off any moisture on the outside of the 
electrocuvette prior to the electroshock administration, and shocking the cells one 
timeatl.5kV, 200n,25uF. 

[0156] Cells are allowed to recover from electroporation by: (1) immediately 

pipetting 1 mL of warm "Recovery Media" into the electrocuvette and thoroughly 
mixing the solution by pipetting; (2) incubating the solution (in the 
electrocuvette) at 30 degrees Celsius for at least three hours for antibiotic 
resistance expression and cell recovery and (3) plating on selection media and 
incubating at 30 degrees Celsius for 3 days. 

EXAMPLE 2 

Preparation of L-Lysine Pathway Constructs Utilizing the Regulatory Regions 
of the Present Invention 

Isolated nucleic acid molecules encoding L-lysine amino acid biosynthesis 
pathway genes are isolated by methods known to those of ordinary skill in the art. 

Constructs which facilitate expression and regulation of L-lysine amino 
acid biosynthesis pathway genes are introduced into Corynebacterium strains by 
methods such as those described in Example 1 . Any strain of Corynebacterium, 
particularly that of Corynebacterium glutamicum, may be utilized for the isolation 
of nucleic acid molecules for use in the preparation of vectors to improve amino 
acid biosynthetic pathway gene expression. Particularly preferred strains include: 
NRRL-B11474, ATCC 21799, ATCC 21529, ATCC 21543, and E12. As one 
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skilled in the art would know, the invention is not limited to these specific strain 
origins. 

[0159] Methods and techniques common to the art of recombinant DNA 

technology are used in making vectors of the invention, as may be found in the 
many laboratory manuals cited and incorporated herein. 

[0160] The polymerase chain reaction (PCR) technique may be used in the 

making of vectors of the invention. In a typical reaction, the standard 10X stock 
solution (100 mM Tris-HCL, pH 8.3, 500 mM KCL, 1.5 mM MgCl 2 ) is diluted 
to IX for use. Typical reaction conditions were used for PCR amplication: 10 
mM Tris, pH 8.3 , 50 mM KC1, 1.5 mM MgC12, 0.01% gelatin, 200uM 
deoxynucleotides, 0.2-1 .OuM primers and 2.0 U/l OOul pfu polymerase. Standard 
cycling parameters were also employed in PCR reactions: For 30 cycles, template 
denaturation was performed at 94 ° C for 3 0 sec . ; 6 5 ° C annealing temperature was 
performed for 1 min (or annealing temperature appropriate for particular primer 
pair); product extension was performed at 72 °C for 1 min (if product is <500 bp), 
3 min (if product is >500 bp); and at the end of cycling, a final extension at 72 °C 
for 7 min was performed. 

(a) Construction of pTrcLacZ 

[0161] A plasmid construct designed to replicate as a plasmid in 

Corynebacteriwn, which contains the (3-galactosidase (lacZ) coding region 
operably associated with the lacIVtrc regulatory region, was constructed as 
follows. Plasmid pZap2, a plasmid which is capable of replicating in 
Corynebacterium, was digested with Xbal and Smal, which linearize the plasmid 
just upstream of the lacZ gene. A cassette containing lacl q and the trc promoter 
was excised from plasmid pTrc99A, available from Amersham Pharmacia 
Biotech, by digestion with Sphl, filling in the overhang, and then digesting with 
Xbal. The 1517-bp cassette (SEQ ID NO:30) was isolated from an agarose gel, 
and ligated into the linearized pZap2 plasmid. The resulting ligation mix was 
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used to transform Escherichia coli, and colonies were isolated from LB plates 
supplemented with ampicillin (100 u-g/ml), Xgal, and IPTG (LBXIA medium). 
Blue colonies were selected and plasmid isolates were verified by restriction 
digestion and electrophoresis. 

(b) Construction of pAraBADLacZ 

[0162] A plasmid construct designed to replicate as a plasmid in 

Coryne bacterium, which contains the lacZ coding region operably associated 
with the araC/araBAD regulatory region, was constructed as follows. Plasmid 
pZap2 was digested with Xbal and Smal as in (a). A cassette containing the araC 
gene and the pBAD promoter was excised from plasmid pBAD 1 8-Cm, available 
from the American Type Culture Collection, Catalog No. 87396, by digestion 
with CM followed by a fill-in reaction and then digestion with Xbal. The cassette 
(i.e., nucleotides 1 to 1338 of SEQ ID NO:23) was isolated from an agarose gel 
and was ligated into the linearized pZap2 plasmid. Transformed E. coli colonies 
were plated on LBXIA medium as in (a), blue colonies were selected, and 
plasmid isolates were verified by restriction digestion and electrophoresis. 

(c) Construction of pAceB10Zap2 

[0163] A plasmid construct designed to replicate as a plasmid in 

Corynebacterium, which contains the lacZ coding region operably associated 
with the C. glutamicum aceB regulatory region, was constructed as follows. A 
cassette containing the aceB promoter was PCR amplified from plasmid pSLD5 
using forward primer 5' GTGC GGATCC TGGCTTTCCA ACGTTT 3' (SEQ ID 
NO:3 1 , BamRl site underlined) and reverse primer 5' CATGGG CTCGAGATGA 
CCTGTGCCTA 3' (SEQ ID NO:32, Sacl site underlined). The resulting PCR 
product was digested with BamHI and Sacl and ligated into the polylinker region 
of cloning vector pDll to produce pD 11 aceB. The resulting plasmid was 
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screened for orientation, and then the aceB promoter cassette was released from 
the vector by digestion with Kpnl and Xbal and cloned into the Kpnl and Xbal 
sites of plasmid pZap2 as described in (a). Transformed E. coli colonies were 
plated on LBXIA medium as in (a), blue colonies were selected, and plasmid 
isolates were verified by restriction digestion and electrophoresis. 

(d) Construction of 1 3 1 aceB/ask 

[0164] A plasmid construct designed to integrate into the C. glutamicum 

chromosome, which contains the C. glutamicum aspartokinase (ask) coding 
region operably associated with the C. glutamicum aceB regulatory region, was 
constructed as follows .' Plasmid pBGS 1 3 1 (described in Spratt, B.G.etal. Gene 
47:337-342 (1986)) was digested with Sma\. The aceB cassette was released 
from plasmid pD 11 aceB, produced as in (c), by digestion with appropriate 
restriction enzymes and the cassette was subjected to a fill-in reaction to produce 
blunt ends. The cassette was then ligated into the digested pBGS13 1 . Plasmid 
isolates were tested for the proper orientation of the aceB promoter by restriction 
digestion and electrophoresis, resulting in plasmid 1 3 1 aceB. The C. glutamicum 
ask coding region was released from pTrc99Aa^A: (produced as in (f), infra) by 
digestion with Sad and Kpnl and cloned into the SacI and Kpnl sites of 1 3 1 aceB, 
to produce \3\aceBask. 

(e) Construction of PD 1 OaceBask 

[0165] A plasmid construct designed to replicate as a plasmid in C. glutamicum, 

which contains the C. glutamicum aspartokinase (ask) coding region operably 
associated with the C. glutamicum aceB regulatory region, was constructed as 
follows . A cassette containing the ask coding region operably associated with the 
aceB regulatory region was released from \2>\aceBask, produced as in (d), by 
digestion with BamHl and Kpnl. The resulting fragment was cloned into the 
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BamHl and Kpnl sites of plasmid pDIO, which is capable of replicating in C. 
glutamicum, to produce pDlOaceBask. 

(f) Construction of 1 3 1 Lacl q /trc-ask 

[0166] A plasmid construct designed to integrate into the C. glutamicum 

chromosome, which contains the C. glutamicum aspartokinase (ask) coding 
region operably associated with the laclVtrc regulatory region, was constructed 
as follows. The PCR-amplified ask coding region was digested with Sad and 
Kpnl, and ligated into plasmid pTrc99A which had been digested with Sacl and 
Kpnl to produce pTrc99Aas&. A cassette containing the ask coding region 
operably associated with the laclVtrc regulatory region was then released from 
pTrc99Aas , &by digestion with Sphl, and was cloned into the Sphl site of plasmid 
pBGS131 to produce 13lLa.cl q /trc-ask. 

(g) Construction of Additional LacZ Plasmid Expression Constructs 

[0167] Plasmid constructs designed to replicate as aplasmid in Coryne bacterium, 

which contain the lacZ coding region operably associated with C. glutamicum 
regulatory regions contained in SEQ ID NOs 1, 2, and 4-22, are constructed as 
follows. Cassettes containing the regulatory regions present in SEQ ID NOs 1, 
2, and 4-22 are PCR amplified from the C. glutamicum chromosome using 
primers engineered to introduce appropriate restriction enzyme recognition sites 
onto the ends of the cassettes. In designing the primers, care is taken to introduce 
sites for restriction enzymes which will not digest the cassette internally. The 
resulting PCR products are digested with the chosen restriction enzymes, and the 
fragments are ligated into a polylinker region of a standard cloning vector. The 
resulting plasmids are screened for orientation, and then the cassettes are released 
from the vector by digestion with appropriate restriction enzymes, filled in as 
necessary, and cloned into the polylinker region upstream of the lacZ gene in 
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plasmid pZap2 as described in (a). Transformed E. coli colonies are plated on 
LBXIA medium as in (a), blue colonies are selected, and plasmid isolates are 
verified by restriction digestion and electrophoresis. 

(h) Construction Additional Aspartokinase Expression Constructs Designed 
to Integrate 

[0168] Plasmid constructs designed to integrate into the C. glutamicum 

chromosome, which contain the C. glutamicum aspartokinase (ask) coding region 
operably associated with C. glutamicum regulatory regions contained in SEQ ID 
NOs 1, 2, and 4-22, are constructed as follows. Plasmid pBGS131 is digested 
with Smal, or other appropriate restriction enzyme. The regulatory cassettes 
prepared as in (g) are released by digestion with appropriate restriction enzymes 
and the cassettes are subjected to fill-in reactions to produce blunt ends as needed. 
The cassettes are then ligated into the digested pBGS13 1. Plasmid isolates are 
tested for the proper orientation of the regulatory regions by restriction digestion 
and electrophoresis, to produce plasmids 1 3 1 SEQ ID NO: 1 , 1 3 1 SEQ ID NO:2, 
and 1 3 1 SEQ ID NO :4- 1 3 1 SEQ ID NO :22 . The C. glutamicum ask coding region 
is released from pTtc99Aask (produced as in (f)) by digestion with Sacl and Kpnl 
(or other appropriate restriction enzymes), or is PCR amplified from a C. 
glutamicum chromosome, and is cloned into the corresponding sites of plasmids 
1 3 1 SEQ ID NO: 1 , 13 1 SEQ ID NO:2, and 1 3 1 SEQ ID NO:4- 1 3 1 SEQ IDNO:22, 
to produce the plasmids 131SEQ ID NO:l-asfc, 131 SEQ ID NO:2-cw*, and 
131 SEQ ID ^OA-ask through 1 3 1 SEQ ID NO:22-ask. 

(i) Construction of Additional Aspartokinase Constructs Designed to 
Replicate as a Plasmid 

[0169] Plasmid constructs designed to replicate as a plasmid in C. glutamicum, 

which contain the C. glutamicum aspartokinase (ask) coding region operably 
associated with C. glutamicum regulatory regions contained in SEQ ID NO 1,2, 
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and 4-22, are constructed as follows. Cassettes containing the ask coding region 
operably associated with the regulatory regions contained in SEQ ID NO : 1 , SEQ 
ID NO:2, and SEQ ID NO:4-SEQ ID NO:22 are released from plasmids 131 SEQ 
ID NO:l-asJfc, 131 SEQ ID NO:2-as£, and 131SEQ ID NO A-ask through 131 
SEQ ID NO:22-<zs&, produced as in (h), by digestion with appropriate restriction 
enzymes. The resulting fragments are cloned into corresponding sites of plasmid 
pDIO, which is capable of replicating in C. glutamicum, to produce plasmids 
PD 1 0/SEQ ID NO: 1 -ask, PD 1 0/SEQ ID NO:2-as£, and PD 1 0/SEQ ID NO:4-a^ 
through PD10/SEQ ID NO:22-ask. 

EXAMPLE 3 

Use of LacZ as a Reporter to Demonstrate Transcriptional Regulation by 
araBAD (SEQ ID NO:23) and the lacl q /trc Transcriptional Complex (SEQ ID 
NO:30) in Corynebacterium (41-004002) 

[0 1 70] The ability of the heterologous transcriptional regulatory regions lacFV trc 

and araBAD (with the araC repressor) to control gene expression in C. 
glutamicum was tested by measuring expression of p-galactosidase. Plasmids 
pTrcZacZ and pAraBADZacZ, produced as in Examples 2(a) and 2(b), 
respectively, were transferred into C. glutamicum according to the methods 
described in Example 1 . The lacZ reporter gene, regulated by either of the two 
regulatory regions, was expressed from a plasmid replicating in C. glutamicum, 
grown in the presence of an inducer, IPTG in the case of the fcrcl q /trc regulatory 
region, or arabinose in the case of the araCI araBAD regulatory region. The level 
of p-galactosidase production was measured, and the results are shown in Table 3. 
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TABLE 3 



regulator/promoter 


Inducer 


nmol/min/mg 






Mean 


S.D. 


none 


none 


0.40 


0.14 


LacIVtrc 


none 


3.84 


0.57 


LaclVtrc 


+IPTG (30mg/L) 


77.23 


0.32 


AraC/AraBAD 


none 


0.47 


0.49 


AraCIAraBAD 


+arabinose (1 g/L) 


5.51 


0.46 



[0171] The trc regulated construct shows uninduced activity. The araBAD 

regulated construct showed no uninduced activity and responded to arabinose. 
The araBAD promoter resulted in a (3-galactosidase expression level that was 14- 
fold lower than that observed with the trc promoter in this organism. 



EXAMPLE 4 

Use of p-Galactosidase Activity to Demonstrate the Response of the C. 
glutamicum aceB Promoter to Acetate, Glucose and Ethanol (98-0040002) 



[0172] The ability of the C. glutamicum aceB regulatory region (SEQ ID NO:3) 

to control gene expression of a heterologous reporter gene in C. glutamicum was 
tested by measuring expression of (3-galactosidase from plasmid p^4ceB10Zap2, 
produced as in Example 2(c). In this construct, the lacZ reporter gene, regulated 
by aceB, is expressed from a plasmid replicating in C. glutamicum. Response of 
the regulatory region to various inducers was tested. The results are shown in 
Table 4. 



TABLE 4 



Rich Medium 


nmol/min/mg 


Minus sucrose 


10 


+ 2% glucose 


3 


+ 0.2% ammonium acetate 


26 


+ 2% ammonium acetate 


50 


+ 2% glucose + 0.2% ammonium acetate 


7 


+ 2% glucose + 2% ammonium acetate 


15 


+ 2% ethanol 


28 



[0173] Glucose repressed reporter activity. Ethanol induced activity but not as 

strongly as acetate. The uninduced control also showed activity implying that 
aceB is self-regulated in this medium. 



EXAMPLE 5 

Use of the trc Promoter to Regulate Aspartokinase Activity when Integrated 
into the Chromosome of C. glutamicum (90-004002) 



[0174] The ability of the lad%rc (SEQ ID NO:30) regulatory region to control 

expression of aspartokinase, an enzyme in the C. glutamicum L-lysine 
biosynthesis pathway (encoded by the ask gene), was tested as follows. The 131- 
2 and 131-5 C. glutamicum strains have the ladVtrc transcriptional regulatory 
region, in operable association with the ask gene, integrated into the C. 
glutamicum chromosome. These strains were prepared by introducing plasmid 
131Laci q /frc-asA: into C. glutamicum by the methods described in Example 1. 
The level of aspartokinase activity was measured both with and without the 
addition of the inducer IPTG. The results are shown in Table 5. 
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TABLE 5 



Strain 


Regulator/promoter- 
gene 


Inducer 


nmol/min/mg 


131-2 


lacPltrc-ask 


none 


59 


131-2 


lacFltrc-ask 


+IPTG (30 mg/L) 


117 


131-5 


lacFltrc-ask 


none 


59 


131-5 


lacFltrc-ask 


+IPTG (30 mg/L) 


123 



[01 75] The level of aspartokinase activity is doubled upon induction when the trc 

regulon is integrated into the chromosome. 

EXAMPLE 6 

Lysine Production by Integrated, Uninduced lacl q /trc-ask Constructs 
(84-004002) 

[0176] This example shows the influence on lysine production of the uninduced 

lacF/trc-ask construct when integrated into the chromosome of C. glutamicum. 
The lacIVtrc regulatory region was used to control expression of aspartokinase 
(encoded by the ask gene), an enzyme in the C. glutamicum L-lysine biosynthesis 
pathway. Bacterial cultures of 131-2 and 131-5, prepared as described in 
Example 5, were grown in shake flasks. BF100 is C. glutamicum strain which is 
a high level lysine producer. The level of aspartokinase activity was measured 
without the addition of an inducer. The results are shown in Table 6. 
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TABLE 6 



Strain 


Induction 


O.D. 


Titre 


Yield 


S.P. 


BF100 


none 


43 


23 


39 


53 


131-2 


none 


34 


27 


46 


82 


131-5 


none 


35 


28 


47 


82 





O.D. = optica l density at 660nm 

Titre = grams Lysine /liter 

Y ield = grams lysine made/grams dextrose consumed 
S.P. = grams lysine/O.D. 



[0 1 77] The leakiness of the trc promoter in this organism may be responsible for 

the improved productivity in the absence of an inducer. 



EXAMPLE 7 

Improvement of Lysine Production in Shake Flasks by aceB-Aspartokinase 
Constructs with No Exogenous Induction (84-0040002) 



[0178] This example shows the influence on lysine production of an aceB- 

Aspartokinase construct when integrated into the C. glutamicum chromosome. 
The C. glutamicum aceB regulatory region (SEQ ID NO:3) was used to control 
expression of aspartokinase (encoded by the ask gene), an enzyme in the C. 
glutamicum L-lysine biosynthesis pathway. Bacterial cultures were grown in 
shake flasks. Strain 131-6 is a C. glutamicum strain which has the aceB 
transcriptional regulatory region in operable association with the ask gene, 
integrated into the bacterial chromosome. Strain 131-6 was prepared by 
introducing plasmid 13 laceBf ask, produced as described in Example 2(d), into 
C. glutamicum by the methods described in Example 1 . BF 1 00 is C. glutamicum 
strain which is a high level lysine producer. The level of aspartokinase activity 
was measured either with or without the addition of the inducer acetate. The 
results are shown in Table 7. 
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TABLE 7 



Strain 


Induction 


O.D. 


Titre 


Yield 


BF100 


none 


43 


25 


37 




acetate 


42 


24 


36 


131-6 


none 


50 


27 


41 




acetate 


47 


27 


40 



Titer = g lysine/L medium 



Yield = g lysine/ g glucose consumed * 100% 

[0179] Induction by ammonium acetate is not required to observe an 

improvement in lysine production by this construct. 

EXAMPLE 8 

Improvement of Lysine Production in Fermentors by aceB-aspartokinase 
Constructs with No Exogenous Induction (Lys002 Lys006RLK) 

[0180] This example shows the influence on lysine production of uninduced 

aceB-Aspartokinase constructs which are expressed from a plasmid 
(VDlOaceBask) replicating in C. glutamicum, or which are integrated 

[0181] into the C. glutamicum chromosome as in strain 131-6, described in 

Example 7. The C. glutamicum aceB regulatory region (SEQ ID NO:3) was used 
to control expression of aspartokinase (encoded by the ask gene), an enzyme in 
the C. glutamicum L-lysine biosynthesis pathway. Bacterial cultures were grown 
in fermentors. The C. glutamicum 1 3 1 -aceB-aspartokinase strain, 131-6, has the 
aceB transcriptional regulatory region in operable association with the ask gene, 
integrated into the C. glutamicum chromosome. The C. glutamicum PD1 0-aceB- 
aspartokinase strain has the aceB transcriptional regulatory region in operable 
association with the ask gene, expressed from a plasmid (PD 1 OaceBask, produced 
as described in Example 2(e)) replicating in C. glutamicum. BF100 is C. 
glutamicum strain which is a high level lysine producer. Aspartokinase activity 
was measured in each of these three strains.The results are shown in Table 8. 
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TABLE 8 



Strain 


Titer 


Yield 


Product 


BF100 


127 


32.9 


735 


1 3 1 -aceB-aspartokinase 


129 


39.8 


833 


BF100 


138 


40.7 


769 


PD 1 0-aceB-aspartokinase 


135 


45.8 


830 



Titer = g lysine/L medium 

Yield = g lysine/g glucose consumed * 100% 

Product = g lysine made in that fermented 



[0182] The aceB-aspartokinase constructs contributed to productivity whether 

they were maintained on a plasmid or integrated into the chromosome. In neither 
case did the constructs need to be induced by an exogenous compound. 

EXAMPLE 9 

Influence of Medium on Beta Galactosidase Activity Using Various Use of 
LacZ as a Reporter to Demonstrate Transcriptional Regulation by the 
Regulatory Regions Contained in SEQ ID NOs 1,2, and 4-22 in 
Corynebacterium 

[0183] The ability of certain transcriptional regulatory regions disclosed herein 

to control gene expression in C. glutamicum was tested by measuring expression 
of (3-galactosidase. The promoters used include (SEQ ID NO: 27) Tac, (SEQ ID 
NO: 1) Pta, (SEQ ID NO: 2) AceA, (SEQ ID NO: 3) AceB, (SEQ ID NO: 4) Adh, 
(SEQ ID NO: 5) AldB, (SEQ ID NO: 6) PoxB, (SEQ ID NO: 7) Ldh, (SEQ ID 
NO:9)MalZ,(SEQIDNO: 10) BglX, (SEQ ID NO: 13)HisD,(SEQIDNO: 14) 
PyrR, (SEQ ID NO: 1 5) PurD, (SEQ ID NO: 1 8) DnaK, (SEQ ID NO: 20) GrpE, 
(SEQ ID NO: 21) ClpB, (SEQ ID NO: 33) LeuA, (SEQ ID NO: 34) IlvA and 
(SEQ ID NO: 35) II vB. The reporter gene was expressed from plasmids 
replicating in C. glutamicum, and was driven by the regulatory regions contained 
in SEQ ID NOs 1,2, and 4-22. The plasmids were prepared as described in 
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Example 2(g). Increased expression of beta-galactosidase under the 
transcriptional control of these transcriptional regulatory regions is shown in 
Table 9. The "Seed" Medium is experimentally determined to support high 
growth. The "Main" Medium is formulated to induce higher expression of 
products, such as lysine, for example. Exemplary medias were described supra. 

For certain regulatory regions, P-galactosidase levels are measured 
without the addition or any inducer or in the presence of various inducers, e.g., 
acetic acid, pyruvate, ethanol, a starch subunit, a sugar, e.g., fructose, maltose, 
lactose or arabinose, a cellulose subunit, a fatty acid, or a triglyceride. In 
addition, certain regulatory regions are tested by growth with a suppressor, e.g., 
a purine, a pyrimidine, and amino acid, or oxygen, followed by a shift to medium 
lacking the suppressor at which time P-galactosidase levels were measured. For 
certain of the transcriptional regulatory regions, P-galactosidase levels are 
measured at optimal growth temperatures, and at temperatures either greater than 
or less than the optimal growth temperature. The p-galactosidase levels were 
measured using certain transcriptional regulatory regions at varying temperatures 
as shown in Table 10. 

TABLE 9 

Influence of Medium on beta-galactosidase Reporter Activity 
(units = nmol/min/mg protein) 

Medium 



SEO ID NO 


Promoter 


SeedC16hf> 


Mainf48hr> 


Change 




Promoterless 


2 


20 


19 


27 


Tac 


231 


342 


111 


1 


Pta 


134 


111 


-23 


2 


AceA 


0 


63 


63 


3 


AceB 


0 


25 


25 


4 


Adh 


0 


65 


65 



-74- 



5 


AldB 


9 


260 


251 


6 


PoxB 


150 


276 


126 


7 


Ldh 


205 


291 


86 


9 


MalZ 


62 


234 


172 


10 


BglX 


57 


208 


151 


13 


HisD 


119 


241 


122 


14 


PyrR 


161 


289 


128 


15 


PurD 


9 


56 


47 


1 G 


DnaK 


61 


223 


162 


20 


GrpE 


40 


149 


109 


21 


ClpB 


104 


233 


129 


33 


LeuA 


174 


267 


193 


34 


IlvA 


120 


229 


109 


35 


IlvB 


22 


166 


144 



TABLE 10 

Effect of Temperature Change on beta-galactosidase Reporter Activity 
(units = nmol/min/mg protein; mean (standard deviation)) 
temperature shift in seed medium for 4 hrs 



SEO ID NO 


Promoter 


30°C 


36 °C 


40 °C 




Promoterless 


0(0.7) 


0(0.2) 


0(0.7) 


21 


clpB 


76(7.3) 


123(2.5) 


165(1.6) 


18 


dnaK 


70(0.8) 


119(3.1) 


169(4.5) 


20 


grpE 


47(1.6) 


107(1.6) 


97(2.6) 



All 3 promoters increased reporter activity at 36°C 
All 3 promoters increased reporter activity at 40°C 
The grpE promoter doubled the activity at both 36°C and 40 °C 
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EXAMPLE 10 

Improvement of Lysine Production Through Transcriptional Regulation of 
Aspartokinase by Regulatory Regions Contained in SEQ ID NOs 1,2, and 4-22 

This example shows the influence on lysine production of induced or 
uninduced constructs in which the transcriptional regulatory regions of SEQ ID 
NOs 1,2, and 4-22 are used to express aspartokinase, either from a plasmid 
replicating in C. glutamicum, or when integrated into the C. glutamicum 
chromosome. The various C. glutamicum regulatory regions are used to control 
expression of aspartokinase, an enzyme in the C. glutamicum L-lysine 
biosynthesis pathway. Bacterial cultures are grown either in shake flasks or 
fermentors. C. glutamicum strains are prepared in which the transcriptional 
regulatory regions of SEQ ID NOs 1 ,2, and 4-22, in operable association with the 
ask gene, are integrated into the C. glutamicum chromosome. These strains are 
prepared by introducing plasmids 131 SEQIDNO:l-as,fc, 131 SEQIDNO:2-fl^, 
and 131SEQIDNO:4-as£through 131 SEQ ID NO:22-ay&, prepared as described 
in Example 2(h), into C. glutamicum by methods described in Example 1. 
Additional strains, with self-replicating plasmids are prepared by introducing 
plasmids PD10/SEQ ID NO:l-as*, PD10/SEQ ID NO:2-ask, and PD10/SEQ ID 
NO:4-a.y& through PD1 0/SEQ ID NO:22-ask, produced as described in Example 
2(i), into C. glutamicum by methods described in Example 1 . BF 1 00, a high level 
lysine producer, is used as a control. Aspartokinase activity is measured in each 
of the resulting strains. For certain regulatory regions, aspartokinase levels are 
measured without the addition or any inducer, or in the presence of various 
inducers, e.g., acetic acid, pyruvate, ethanol, a starch subunit, a sugar, e.g., 
fructose, maltose, lactose or arabinose, a cellulose subunit, a fatty acid, or a 
triglyceride. In addition, certain regulatory regions are tested by growth either 
without, or with a suppressor, e.g., a purine, a pyrimidine, and amino acid, or 
oxygen, followed by a shift to medium lacking the suppressor at which time 
aspartokinase levels are measured. Finally, for certain regulatory regions, 
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aspartokinase levels are measured at optimal growth temperatures, and also at 
temperatures either greater than or less than the optimal growth temperature. 

***** 

[0186] Having now fully described the present invention in some detail by way 

of illustration and example for purposes of clarity of understanding, it will be 
obvious to one of ordinary skill in the art that same can be performed by 
modifying or changing the invention with a wide and equivalent range of 
conditions, formulations and other parameters thereof, and that such 
modifications or changes are intended to be encompassed within the scope of the 
appended claims. 

[0187] All publications, patents and patent applications mentioned in this 

specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 



